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© A yeast cell of the genus schwannlomyces. 

© The present invention relates to a yeast cell of the genus Schwanniomyces wherein said yeast cell contains 
at least one expression cassette comprising: 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide, 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. 

In a preferred embodiment of the invention, the first, second and/or fourth DNA sequences are derived from 
genes coding for amylolytic enzymes of yeast cells of the genus Schwanniomyces . The use of these functional 
sequences in an expression cassette provides efficient expression of foreign genes in yeast cells according to 
the present invention and, furthermore, provides a system for efficient secretion of the expression product into 
the medium. 
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Th present Invention relates to yeast cells of the genus Schwanniomyces and a process for the 

production of polypeptides. 

For expression of eukaryotic proteins of biotechnogical relevance there exists a variety of host systems, 
which are selected in compliance with the specific requirements. The most frequently used host organism is 
Escherichia coli, being the best known representative of prokaryotic host organisms; Saccharomyces 
cerevisiae anTanimal tissue cultures are used as eukaryotic host cells. For certain purposes, e.g. where 
expression of a glycoprotein is desired, expression in E.coll or other prokaryotic organisms is not 
appropriate, since same lack any glycosylate capability. On the other hand, the mass production of 
protein in tissue cultures still is cumbersome and expensive; therefore, the organism of choice in many 
instances is yeast of the well characterized genus Saccharomyces . S. cerevisiae is able to secrete 
efficiently small proteins, e.g. polypeptides like MQ1 {Kurjan and Herskowitz, 1982). jS-endorphin or a- 
interferon (Bitter et al.. 1984). however proteins of higher molecular weight are captured in the periplasmic 
space (Emr et al. 1981) or the cytoplasma. thus causing decreased growth rates or even death of the cell. 

Therefore it is an object of the present invention to provide an improved eukaryotic expression system, 
which is optionally capable of secreting proteins encoded by a foreign DNA Into the culture medium. 

This object has been solved by a yeast cell of the genus Schwanniomyces , wherein said yeast cell 
contains at least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. 

The invention, which comprises further subjects, is now described in a more detailed manner by the 
following description, examples and figures. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig.1 

A: Restriction map of the AMY1 gene encoding a-amylase and its 5 and 3 ^ noncodmg region 
B: Nucleotide sequence of the AMY1 gene and its flanking region. In the 5 region the transcription 
initation site is indicated by a dashed line above the sequence. Within the structural gene the presumptive 
signal peptidase cleavage sites are indicated by star-symbols. The two potential glycosylation sites, the 
cystein residues presumably forming four disulfide bonds (concluded by homology to the A. oryzae 
enzyme) are underlined. Within the 3' end region the transcription termination signal TAG...TATGT...TTT is 
indicated by a solid line. 

Fig-2 , 
A: Restriction map of the GAM1 gene encoding glucoamylase and its 5 and 3 noncodmg region 
B: Nucleotide sequence of the GAM1 gene and its flanking region. In the 5 region the transcription 
initiation site is indicated by a dashed line. Within the structural gene the presumptive signal peptides 
cleavage sites are indicated by star-symbols. 
Fig.3 

M13 Xhol. The EcoRI-Scal fragment, containing a 1.8 kb promoter fragment and approx. 0.4 kb structural 
gene fragment of a-amylase gene, was subcloned into M13mp8 previously digested with EcoRI and Smal. 
By site directed mutagenesis a Xhoi-site was inserted just in front of the ATG codon: 

** * 

5 ' ..TAAAATAAAAGCTCGAG A J G_A G A T J_T t . 3J. . 



Positions where bases have been exchanged by site directed mutagenesis, are indicated by asterisks. 

Fig.4 , . , 

pMaGAM-B/S. Th Bglll fragment containing the glucoamylase gene and its 5 and 3 flanking region was 
subcloned into the BamHI site of vector pMAC5-8 (Kramer et al.. 1984). By site directed mutagenesis using 
the gapped duplex DNA methode (Kramer et al., 1984) a BamHI site and a Sail site were inserted just in 
front of the translation initiation codon ATG: 
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* ****** ** 

5' . . CTCAT6ACTGTGTCGACGGATCCAAG ATG^^IJJ-XIi-s * 

Positions affected by the site specific mutagenesis are indicated by asterisks. 

YRp7oQAM. The GAM gene was isolated from pMaGAM-B/S as a Sail fragment and ligated into the Xhoi 
site of M13aXho (Fig.3). The Bglil fragment harbouring the a-amylase promoter GAM gene fusion was 
ligated into the single BamHI site of the S. cerevisiae vector YRp7 resulting in plasmid YRp7aGAM. 

Construction of S. occidentaiis vectors for expression of foreign genes using the promoter and the signal 
sequence of the glucoamylase gene. 

a- Insertion of the 4 0 kb EcoRI-Pvuil fragment from pBRSwARSGAM, containing the complete 
GAM promoter and the first 208 bp of the GAM encoding sequence into pRR322, cleaved with EcoRI/Pvull, 

resulting in vector pBRGAM. 

b* Insertion of the 3.4 kb Pvull fragment from plasmid pCT603, containing the structural gene 
coding for celiulase from base pair position +112, into the Pvull site of plasmid pBRGAM, resulting in 

vector P B c R ^ rtjon Qf 5 5 kb Bg|I| . Pst | fra g m ent from plasmid pBRGCI into BamHI/Pstl cut pCJD5-1. 
The resulting plasmid, pMPGC1-1 contains 321 bp of the GAM 5 non coding region and the first 208 bp of 
the GAM coding region fused to position + 1 12 of the celD gene. 

d- Insertion of the 6.5 kb BamHI-Pstl fragment from plasmid pBRGCI into BamHI/Pstl cut pCJD5- 
1. The resulting plasmid. pMPGC1-2 contains 1.3 kb of the GAM 5 non coding region and the first 208 bp 
of the GAM coding region fused to position +112 of the celD. 

Promotefstudies using the qualitative endoglucanase D activity assay (congo red staining) with different 

transformants: n 
NGA58: pMPAGC, NGA58: pMPPGC, NGA58: pMPGCM. NGA58: pMPGC1-2 
a: Growth on 2% maltose 
b: Growth on 4% glucose 
Fig 8 

Construction of a S. occidentaiis vector for expression of foreign genes under control of the ADH1 Promoter. 

a: Insertion" of the 1.45 kb BamHI-Sail ADH1 promoter fragment of plasmid pFM2-1 into pCJDS-1. 
previously cut with BamHI/Sall resulting in plasmid pMPADHL ^ AP ^ w • 

35 b: Insertion of the 3.2 kb Sail fragment of pJDcg-15 into the Sail site of pMPADHL resulting in 

PMPAG *c: Exchange of the GAM promoter in plasmid pMPGC1-2 (Fig 6d) for the ADH1 promoter from 
plasmid pMPAG by in vivo recombination, resulting in plasmid pMPAGC. 

40 Construction of a S. occidentaiis vector for the expression of foreign genes under control of the PDC 
promoter ^ ^ q q kb Sph ,. Ava) fragment from p i as mid pJDcg-15. containing the PDC promoter 

fused to the GAM gene into YRpJD2 previously cleaved with Sphl-Aval. resulting in plasmid pMPPG 

b: Exchange of the GAM promoter in plasmid pMPGCl-2 (Fig. 6d) for the PDC promoter from 
plasmid pMPPG by in vivo recombination, resulting in plasmid pMPPGC. 

ConstrSion°of an autonomous replicating vector (containing SwARS2) for transformation of S. occidentaiis 
The 3.2 kb TRP5 BamHI fragment from plasmid YRpJD2 was ligated with the 5.5 kb BamHI-BglM fragment 
of plasmid pBRSwARSGAM (Fig 6a). resulting in pMPTS2-A and pMPTS2-B. 

ABBREVIATIONS USED THROUGHOUT THE APPLICATION 
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Restriction endonucleases 

Aa-Aatil 
A - Asp718 



3 



EP 0 394 538 A1 



B - BamHI 

Bg - Bglll 

C - Clal 

E - EcoRI 
5 EV - EcoRV 

H - Hindlll 

P - Pvull 

Ps - Pstl 

S - Sail 
w Sc - Seal 

Sm - Smal 

Sp - Sphl 

X-Xhoi 

Throughout this application various publications are referenced by mention of the first author and the 
15 year of publication within parentheses. Full citation of the references may be found at the end of the 
Specification as an annex, listed according to their alphabetical order immediately following the specifica- 
tion. The disclosures of these publications in their entireties are hereby incorporated by reference into this 
application in order to more fully describe the state of the art as known to those skilled therein as of the 
date of the invention described and claimed herein. 
20 The yeast molecule according to the present invention is characterized by containing an expression 
cassette which in turn is composed of three to four different components, being arranged in operable 
linkage in order to ensure efficient expression of a desired polypeptide, optionally followed by secretion of 
same. These components are defined to be the following : 

a) A first DNA sequence serving as a regulon. The term "a regulon" as used throughout the 
25 application comprises any cis-acting DNA sequence involved in the regulation of expression of a given 

structural gene. The term thus embraces sequences preceeding a given coding sequence, for example, a 
promoter and sequences which are recognised by transcription factors or other proteins involved in 
transcription regulation. The "first DNA sequence" is thus not limited to sequences corresponding to known 
promoters. Also included are DNA sequences corresponding to natural occuring regulons, which have 
30 undergone insertion, deletion or substitution events, but still retain their activity as regulon. Further the term 
"regulon" includes DNA sequences involved in regulation of transcription, which have been synthesised 
chemically or obtained by methods of gene technology. 

b) An optional second DNA sequence coding for a signal peptide. The meaning of this second DNA 
sequence includes various DNA sequences, directing the secretion of any polypeptide containing the thus 

35 encoded peptide at its N -terminal end. 

c) A third DNA sequence coding for a foreign protein. This coding sequence coding for the signal 
peptide in order to retain a proper reading frame for the third DNA sequence. The fusion thus requires 
application of methods or recombination DNA technology known to every person skilled in the art. For 
methods enabling to design proper fusions it is in general referred to Maniatis et al. (1982), wherein 

40 appropriate methods, e.g. various ways to treat restriction fragment ends, cloning of linker molecules and 
similar procedures are described in great detail. 

The foreign protein coded for by the third DNA sequences may be any protein which to express there 
exists some need. Since the secretion system is capable of secretion not only of small molecules, but also 
of large proteins, expression of any desired protein should be possible. For examples it is referred to the 
45 discussion below. 

d) Optionally a fourth DNA sequence serving as a terminator. As the terminator any sequence may 
be used, which efficiently terminates transcription in the respective host organism. Terminators may be for 
example sequences prone to formation of hairpin structures and/or polyadenylation sites. In a preferred 
embodiment the terminator is derived from the same gene as the second DNA sequence, i.e. the regulon, is 

so derived from. 

The four components are combined with each other using known techniques. Optionally the proper 
fusion will be v rified for example by sequencing the boundaries. 

The yeast cell according to the present invention contains at least on expression cassette comprising 
some or all of the above mentioned DNA sequences. It provides a convenient tool for the industrial 
55 expression of any desired protein. 

In a preferred embodiment said first and/or second and/or fourth DNA sequence ar derived from a 
gene coding for an amyiolytic enzyme, for example a-amylase or glucoamylase, which nzymes in most 
organisms are induced upon contact with starch. The DNA sequences may be derived from a yeast of a 
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memb r of on of the genera Debariomyces . Saccharomyces , Upomycas . Plchia , or Saccharomyces . but 
yeast f the genus Schwannlomyces ar preferred. 

Yeast species of the genus Schwanniomyces have an fficient expression and secretion apparatus, 
enabling them to grow on starch as a sole carbon source. Starch is hydrolysed to glucose by two amylolytic 
enzymes, secreted Into the supernatant, namely a-amylase (a- 1,4-glucan 4-glucanohydrolase E.C.3.2.1.1) 
and glucoamylase (syn. amyloglucosidase) (a-1 ,4-glucanglucohydrolase. E.C.3.2.1 .3.; with debranching 
activity E.C.3.2.1 .9.). The amylolytic enzymes of Schwannlomyces occldentalls , formerly also called 
schwanniomyces castellii and Schwanniomyces alluvius (Price et al., 1978), are well documented (Oteng- 
Gyangetal., 1981; Sills et al., 1982. 1984a. 1984b; Wilson etal.. 1982). 

The genes for a- amylase and glucoamylase from a yeast of the genus Schwanniomyces have been 
disclosed by the present inventors in EP 87 110 370.1. However, at that time the DNA or peptide 
sequences directing secretion have not been identified. Furthermore their use for the establishment of an 
efficient expression and secretion system in yeast was not known then. 

It is preferred, to use as a first DNA sequence part or all of a 1.8 kb Bglll-Xhol fragment which 
proceeds the structural gene coding for the Schwanniomyces a-amylase. or part or all of a 1.3 kb BamHI- 
Pvull fragment proceeding the structural gene coding for the Schwanniomyces occidentalis glucoamylase. 
Further if only smaller fragments are to be used, it is preferred, to use a DNA sequence comprising part or 
all of the DNA sequence corresponding to bases -1 to -540 of the region proceeding the structural gene 
coding for a-amylase or part or all of the DNA sequence corresponding to bases -1 to -320 of the region 
preceding the structural gene coding for glucoamylase. The respective DNA fragments are fully active as 
promoters, inducible by starch, dextrin, maltose. 

It is obvious, that replacement of bases or base pairs, not involved in the transcriptional regulation, still 
are within the spirits of the present invention. Thus natural mutations or synthetic equivalents also are 
embraced by the present invention. 

In yeast cells of the genus Schwanniomyces it is also possible to use promoters derived from yeast 
genes obtained from many other yeast genera. Preferred are regulons involved in the expression of ADH1, 
ADH2. PDC1. GAL1/10. PGK and GAPDH, or LAC4 which are obtained from Saccharomyces cerevisiae . 
Kluyveromyces Lactis or Schwanniomyces occidentalis . 

FTJrtTifTn^eTitls possible to use viral regulons and/or terminators, for example regulons and termina- 
tors obtained from E.coli T-phages. The phages possess extraordinarily strong functional sequences for 
performance of theirTytic functions. In a preferred embodiment sequences of phage T7 are used, which are 
regulated by expression of T7 RNA polymerase. 

In order to increase the over all expression level and furthermore, in order to facilitate purification of 
produced polypeptides, it is most desirable, that the polypeptides are secreted into the culture medium. 
The present invention provides signal sequences having the invaluable advantage of being denved from 
genes of amylolytic and thus inducible enzymes, which upon induction have to be secreted very efficiently. 
A further advantage is. that induction of amylolytic enzymes does not require complicated media, lacking 
for example a specific amino acid or phosphate, but only require the culture to be transferred to a starch 
containing medium. Signal sequences, which are preferably used for secretion of large proteins include part 
w or all of at least one of the following peptides: 

a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle LeuAlaPheSerArgLeuValSerAla; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle LeuAlaPheSer- 
ArgLeuValSerAlaGlnProllellePheAspMetArg; 

c) MetllePheLeuLysLeulleLysSerlleVallleGlyLeuGlyLeu ValSerAla; 

45 d) MetllePheLeuLysLeulleLysSerlleVallleGlyLeuGlyLeu ValSerAlalleGlnAlaAlaProAla; 

e) MetllePheLeuLysLeulleLysSerlleVallLeGlyLeuGlyLeu ValSerAla- 

HeGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

In a further embodiment the above mentioned peptides are coded for by DNA sequences, which 

correspond to all or part of the following DNA sequences: ^„.. rW r,-AA/-»A-rr 

a) ATGAGATTTrCAACTGMGGATTTACAAGTAMGTTGTTGCAGCMTTTT AGCATTCTCAAGATT- 

^^^b^^GAGATTTTCMCTGMGGATTTACAAGTAMGTTGTTGCAGCMTTT TAGCATTCTCAAGATT- 

GGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

c) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA GTTAGTGCT; 

d) ATGATTTTTCTGMGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA GTTAGTGCTATCCAAG- 

CAG °e) CC A?GATTmCTGAAGCTGATTAAM GTTAGTGCTATCCAAG- 
CAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA GCA. 
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It will be understood, that the replacement of amino acids by residues having comparable properties 
and use of alternative codons in compliance with the possibilities provided by the genetic code will not have 
any impact on the yeast cell according to the present invention. 

The polypeptide to be expressed in a yeast cell according to the present invention is coded for by a 
5 third DNA sequence. Said third DNA sequence may comprise natural or synthetic DNA, even DNA 
containing intervening sequences. Examples for proteins being expressed by the yeast cell according to the 
present invention comprise proteins of enormous importance for scientific or medical purposes, for example 
cellulase, interleukine, interferon, insulin like growth factor, lymphokine, human growth factor, nerve growth 
factor, aprotinin, insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral 
w or bacterial vaccines or human granulocyte/macrophage colony stimulating factor. 

The yeast cell according to the present invention may optionally comprise a terminator as component of 
said expression cassette. Preferred terminators are derived from genes of one of the following genera : 
Saccharomyces, Pichia. Hansenula . but preferably Schwannidmyces . Examples of preferred terminators are 
those derived from Schwanniomyces a-amylase gene or Schwanniomyces glucoamylase gene, part or all of 
15 which my be used. Such terminators are comprised by nucleotides 1537 to 1740 of the Schwanniomyces a- 
amylase gene or nucleotides 2875 to 3320 of the Schwanniomyces glucoamylase gene. 

It is obvious, that in case of any of the DNA sequences mentioned modifications, which do not impair 
the biological function of the respective DNA fragment still are covered by the present application. There 
are numerous possibilities to change the DNA sequences mentioned above slightly without impairing the 
20 biological function of said fragment gene. 

The yeast cell according to the present invention, carrying an expression cassette composed of the 
above discussed components, may be carried by a circular or linear vector, which optionally comprises one 
or more selective marker genes. The present inventors demonstrated, that a lot of marker genes so far used 
for transformation of Saccharomyces , for example TRP5, LEU2. ADE1. ADE2. HIS3. H1S4, URA3, LYS2. 
25 which may be obtained from Saccharomyces . Hansenula , Pichia or Schwanniomyces . are functional in 
Schwanniomyces occidentalis. Furthermore it is also possible to use the amylolytic properties ofa-amylase 
or glucoamylase in order to select transformed organisms. The corresponding genes have been isolated 
and characterized. AMY1 and GAM1 are preferably obtained from Schwanniomyces . 

The yeast cell according to the present invention may contain the expression cassette as an insert into 
30 one of the endogenous yeast chromosomes. Furthermore, there is however the possibility, that a DNA 
sequence capable of controlling autonomous replication and even also stable maintenance of the vector in 
the host organism, is introduced into the yeast cell. Said DNA sequences are the so called ARS sequences, 
capable of controlling autonomous replication and stable maintenance in the respective host organism. Said 
DNA sequences are preferably derived from Saccharomyces . Pichia Hansenula or Kluyveromyces , most 
35 preferred from Schwanniomyces . 

In order to ensure effective replication in the prospective host organism, it is preferred to use an ARS, 
derived from the respective host organism. 

The present invention provides one new SwARS sequence. The DNA sequence SwARS2 is capable of 
controlling autonomous replication and stable maintenance of the vector in a Schwanniomyces yeast cell. 
40 The detailed sequence of SwARS has not yet been determined. SwARS2 is localised in the Schwan- 
niomyces genome in the 3' region of the glucoamylase gene, as shown for example in Fig. 10. SwARS2 is 
easy to identify and efficiently allows autonomous replication of any piasmid. exhibiting this sequence. The 
so far known Schwanniomyces autonomously replicating sequences, SwARSI and SwARS2 are comprised 
on a Clal-BamHI fragment (SwARSI) adjacent to the EcoRI-fragment shown in Fig 2a (see Fig. 10) and/or 
45 an EcoRi-Bglll fragment contained in the EcoRI fragment shown in Fig 2a (SwARS2). 

For the SwARS sequences it is also not necessary, in any case to use the sequence as it occurs 
naturally. DNA sequences, that have been obtained by any kind of modifying treatment, for example by 
deletion, insertion or substitution of one or more nucleotides or base pairs, are still within the scope of the 
present invention, provided the modifications retain or improve the biological function of the yeast cell 
so according to the present invention. 

In a preferred embodiment the yeast cell according to the present invention comprises a DNA 
sequence, which is homologous to genomic Schwanniomyces DNA. Provision of structures of homologous 
DNA enable the DNA fragment flanked by such homologous sequences to be inserted by homologous 
recombination. It is thus preferred to have homologous sequences, which do not represent any essential 
55 gene of the Schwanniomyces genomic DNA, but rather any gene, whose expression product may be 
supplemented by additives to the respective medium or were lack of the respective expression product 
does not have any serious impact on the transformed yeast cell. 

If no SwARS sequence or any equivalent sequence, for example originating from any other yeast 
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species, is available, the expression cassette can be used without an autonomously replicating agent. In this 
case the expression cassette tends to integrate into th genome, for example via homologous recom- 
bination. Preferably there are several copies inserted into on or more chromosomes. 

In this case it is preferred, to have multiple copies of the expression cassette inserted, in order to. 

s ensure simultaneous transcription and translation of several copies. 

When the yeast cell according to the present invention is derived from a yeast of the genus 
Schwanniomyces the polypeptide products observed following expression of the respective foreign gene in 
said yeast cells exhibit glycosylate patterns, which closely resemble or even match those of the desired 
eucaryotic protein. Yeast cells of the genus Schwanniomyces are thus preferred for expression of 

jo eucaryotic proteins. 

The above discussed yeast cell according to the present invention can be used in a process for the 
production of a polypeptide, wherein a yeast host organism is cultivated under suitable conditions and the 
polypeptide is recovered in a manner known per se. The conditions of cultivation depend on the respective 
host organism used. Examples for cultivation conditions for yeast of the genus Schwanniomyces are 
is provided in the examples below. The polypeptide, produced by the yeast according to the present 
invention, is recovered by conventional purification methods, which depend on the properties of the 
expressed protein, the availability of specific absorbing agents and so on. 

In a preferred embodiment yeast cells of the species Schwanniomyces occidentalis are used as a host 
organism and are cultivated on starch.dextrin. maltose and/or plant biomass as the respective carbon 
20 source. As commonly known, these carbon sources are available at low costs. 

Furthermore, there is a possibility to use cells of the genus Schwanniomyces for the production of 
single cell protein. The single cell protein may then be subjected to any further use as it is required. 

In a preferred embodiment the polypeptide produced in a process according to the present invention is 
an anxiolytic enzyme, for example an a-amylase or glucoamylase. These enzymes are commercially used 
to debranch and degrade starch, which thereupon may be further processed, for example m brewery or 
baking processes. There is a high need for the provision of starch degrading enzymes. 

In a further embodiment of the process according to the present invention production of a polypeptide 
may be achieved by providing expression cassettes as discussed above containing a yeast a-amylase or 
glucoamylase signal sequence and inserting the respective expression cassette into a yeast of one of the 
genera Saccharomyces. Kluyveromyces , Hansenula . Pichia or Schizosaccharomyces . Production and secre- 
tion of foreign proteins in the respective yeast host organism is obtained by applying the appropriate 
inductive measure, whereupon the produced protein is secreted due to the presence of the yeast a-amylase 
or glucoamylase signal sequence. Since it is known from experiments performed by the present inventor, 
that these signal sequences are recognised in virtually any yeast genus, the a-amylase or glucoamylase 
signal sequences may be applied in many different yeast with success. Also in this case, it is preferred, to 
the a-amylase or glucoamylase signal sequences derived from the respective genes of Schwan- 
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niomyces occidentalis. . 

irTIU mmary the y east cells according to the present invention are well surted for transformation and 

expression of any desired foreign gene . As already outlined above, expression of procaryotJc as well as 
eucaryotic genes is possible. However, for expression of eucaryotic proteins yeast cells of the genus 
Schwanniomyces containing some or all of the DNA sequences mentioned above are the most preferred 
yeast cells. 

CHARACTERISATION OF THE a-AMYLASE AND GLUCOAMYLASE GENE EXPRESSION IN DIFFERENT 
YEAST GENERA : 

The N-termini of the deduced amino acid sequences of the a-amylase gene (Fig.lB) and glucoamylase 
gene (Fig.2B) exhibit the features of typical leader sequences of secretory proteins (Von Heyne 1983. 
Periman and Halvorsson 1983). The N-terminal amino acids of mature a-amylase secreted by S. occidejv 
talis are determined to be AspValSer. This indicates processing between Arg33 and Asp31 which is unusual 
IFdnot according to the Von Heijnes model of site specificity for signal peptidase cleavage. This putabve 
cleavage site was confirmed also for a-amylase expressed in S. cerevisiae . Pichia stipjdis.K. lactis. S. 
pombe and H. polymorpha . However, in these yeast species processing is also found to occur b tween Ala 
25 and Gin 26. resulting in the N-terminal sequence GlnProlle. 

«-amylase isolated from a culture of S. occidentalis migrates as a single band of 55 kDa on SDS 
polyacrylamide gels: following EndoglucosiJase H treatment and subsequent gel electrophoreses a single 
band of 54 kDa is observed, indicating that only one of the two potential N-glycosylaoon sites of the protein 
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is used for glycosylation. It is noteworthy that high mannose glycosylate, as very often found by proteins 
secreted in S. cerevlsiae is not accompanying secretion in & occidentalis , 

IDENTIFICATION OF THE TRANSCRIPTION INITATION SITES AND PROMOTER FUNCTION OF THE a- 
AMYLASE AND GLUCOAMYLASE GENE. 

By S1 mapping of polyA-mRNA (Sharp et al.. 1980) the transcription initiation site of the a-amylase 
gene could be localised to be in the region of -42 to -30 (indicated by an interrupted line above the relevant 
sequence in Fig.lB) and the transcription initiation site of the glucoamylase gene was determined to be in 
the region -7 to -10 (underlined in Fig.2B). 



CONSTRUCTION OF AN a-AMYI_ASE - GLUCOAMYLASE HYBRID GENE 

In order to facilitate use of the a-amylase promoter and glucoamylase structural gene in further 
constructions additional restriction sites were inserted in front of the translation initiation codon of both 
genes using oligonucleotide directed mutagenesis as described in the AMERSHAM protocol. Correct 
mutagenesis was verified by sequencing according to Sanger (1977). The boundaries between the original 
Schwanniomyces genes and the inserted polylinker are shown below : 
a-amylase : 

** * 

5 1 . .TAAAATAAAA6CTCGAGATG_AGAJJJT. . . 3 9 . 

Glucoamylase: 



* ****** ** 

5 ' . . CTCATGACTGTGTCGACGGATCCAAG AJGJ5JQTJJJT tAJA 3 * . 

Sail BamHI 



The manipulations resulted in plasmids Ml3aXho (Fig.3) and pMaGAM-B/S (Fig.4). 

Thus, both the a-amylase promoter and the structural glucoamylase gene can be isolated from piasmid 
M13a Xho and pMaGAM-B/S as suitable DNA fragments. The GAM gene was isolated from pMaGam B/S 
as a 3.2 kb Sail fragment and ligated into the Xhoi site of M13aho. The Bgill fragment harbouring the a- 
amylase promoter/GAM gene fusion was ligated into the single BamHI site of the S. cerevisiae vector YRp7 
(Struhl et al., 1979) resulting in piasmid YRp7aGAM (Fig 5). S. cerevisiae strain YNN27 was transformed 
with this piasmid selecting for TRP prototrophy. Transformants secreted active glucoamylase into the 
culture supernatant (50 mU/ml) indicating that the a-amylase promoter fragment can direct expression and 
secretion of foreign proteins. 

SECRETION OF o-AMYLASE AND GLUCOAMYLASE IN DIFFERENT YEASTS USING SUITABLE PRO- 
MOTERS. 

Expression of the a-amylase and glucoamylase gene and secretion of the gene products can be 
detected in KJactis under control of the promoter of the a-galactosidase (LAW) gene (Breunig et al., 1984). 
Expression of the a-amylase and glucoamylase gene and secretion of the both proteins can also be 
detected in S. pomb under control of the ADH1 promoter (Russel et al., 1983) or of the GAL1/10 promoters 
(Johnston and Davis 1984). Aft r fusion with the GAL1/10 promoter (Johnston and Davis 1984). DHAS 
promoter (Janowicz et al., 1985), MOX promoter (Ledeboer et al., 1985) or FMDH promoter (EP 87 110 
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417.0) expression of the a-amylase and glucoamylase gene and secretion of both proteins can be detected 
in Hansenula polymorpha . 

EXPRESSION AND SECRETION STUDIES IN S. OCCIDENTALIS USING THE CELLULASE EN- 
DOQLUCANASE D (EGD) OF CLOSTRIDIUM THERMOCELLUM AS A MODEL FOR FOREIGN GENE 
EXPRESSION. 

As a first example for the expression of a foreign gene in S. occidentalis . the celD gene from C. 
thermocellum (Millet et al., 1985. Joliff et al.. 1986b), which codes for a thermostable cellulase, namely an 
endoglucanase (EGD) was used. Advantages in using this system are : 

- specific reaction of EGD. which can easily be monitored 

- a rapid qualitative assay for colonies with EGD activity (see material and methods) 

- availability of antibodies against EGD for specific detection of translation products. 

CHARACTERISATION OF THE GLUCOAMYLASE PROMOTER AFTER FUSION WITH THE celD GENE 

Plasmids were constructed containing a replicon from S. occidentalis (SwARSI) (EP 87110370.1) the 
TRP5 gene from S. cerevisiae (Zalkin and Yanowski,1982) as a selective marker and, in addition, a 
GAM/celD gene fusion under the control of different promoters. In the first step the 4.0 kb "oRI-Pyull 
fragment from plasmid pBRSwARSGAM (Fig. 6a) was isolated and inserted into the 2296 bp PBR322 
EcoRI-Pvull fragment, containing the amp gene and the bacterial origin, resulting in plasmid pBRPGAM 
(Fig 6a) In addition to the p8R322 seance this plasmid carries 3.6 kb from the 5 non coding region of 
the GAM gene and the first 208 bp coding for the N-terminal part (including the signal sequence) of the 
qlucoamylase. A 3.4 kb Pvull fragment from plasmid pCT603 (Joliff et al., 1986) containing the coding 
region of the celD gene missing the 5 1 11 bp was inserted into the Pvull single site of pBRPGAM resulting 
in pBRGCI (Fig.6b). E. coll transformed with this plasmid results in weak Carboxymethylcellulase (CMCase) 
activity, analysed by~conio red staining (see material and methods) indicating that the glucoamylase 
30 promoter is slightly expressed in E. coli. For the construction of a S. occidentalis expressio n vector the 
plasmid pCJD5-1 (EP 871 l0370.lTwis cleaved with BamHI/Pstl and ligated with the 5.5 kb Bglll-Pstl 
fragment or 6.5 kb BamHI-Pstl fragment from pBRGCI, resulting in pMPGCM (Fig.6c) and pMPGC1-2 
(Fia 6d) respectively. Both plasmids share an in frame fusion of GAM/celD but differ in the length of the 5 
upstream region of the GAM gene. S. occidentalis strain NGA58 (trp5 ade) isolated after UV mutagenesis of 
35 strain NGA23 was transformed withlhese plasmids and transformants selected for tryptophan prototrophy 
and analyzed by means of congo red assay for cellulase activity. Transformants produced act.ve EGD when 
grown under inducing conditions in 2% maltose (Fig 7a). By analysis with the congo red assay no activity 
could be detected under repressed conditions in 4% glucose (F.g. 7b). NGA58:pMPGC1-1 and 
NGA58*pMPGCl -2 transformants secrete active EGD into the culture supernatant when grown in > maltose 
40 under induced conditions. The intact or complete glucoamylase promoter resides on plasmid PMPGC1-2 as 
shown by 20 times higher activity compared to the shorter promoter fragment in gene fusion of pMPGC 1-1 
(table 1) However the 0.3 kb promoter fragment still harbours regulatory units which lead to catabolite 
repression by glucose (FigJb). Comparison of the N-terminai GAM sequence to the signal peptidase 
cleavage sites of several other secretory proteins (Von Heijne 1983) reveals three potential signal peptidase 
45 cleavage sites after the amino acid residues Ala 19. Ala 25 and Ala 34. For determination of the N-term.nal 
amino acid of the mature glucoamylase the protein was isolated for N-terminal amino acid sequencing. 
However the N-terminus is blocked and therefore could not be sequenced. To confirm the correct cleavage 
site of the GAM signal sequence in S. occidentalis , the regions between the potential signal peptidase 
cleavage sites and the Pvull gene fusioTT site are deleted in the GAM/celD fusion by in vitro mutagenesis to 
so study the influence on secretion of active EGD. 

STUDIES OF THE USE OF SUITABLE S. CEREVISIAE PROMOTERS IN THE EXPRESSION OF FOREIGN 
GENES IN S. OCCIDENTALIS . 

The ADH1 promoter (Hitzeman et al.. 1981) was isolated as a 1.45 kb BamHI-Sall fragment from 
plasmid pFM2-1 (MUller et a.., 1987) and .igated into plasmid pCJD5-1 (EP 87 110370.1) < Mdwi}.«» 
BamHI and Sail resulting in pMPADHl (Fig.8a). The GAM gen was excised as a Sail fragment from 
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pJDcg-15 (see below) and Inserted into the single Sail site of pMPADHI (Fig. 8b) resulting in plasmid 
pMPAG (Rg. 8b). By the means of in vivo recombination (Ma et al., 1987) the GAM promoter from 
pMPGC1-2 (Fig 6d.) was replaced by the ADH1 promoter of pMPAG leading to plasmid pMPAGC (Rg. 8c). 
Plasmid pMPGC1-2 was linearized with BamHI; the unique BamHI site of pMPGC1-2 is localized between 
the TRP5 coding sequence and the glucoamylase promoter. The 2.75 kb Clal fragment of pMPAG 
containing the ADH1 promoter shares homologies to the TRP5 gene as well as to the GAM coding region 
up to position +208. For in vivo recombination NGA58 was transformed with BamHI linearized pMPGC1-2 
together with approx. 10 fold excess of Clal fragment using the transformation protocol of Ito et al., 1983. 

Recombinants were screened for their ability to hydrolyze CMC on 4% glucose. Under these conditions 
celD expression is repressed in transformants harbouring recircularized pMPGC1-2. From 146 transformants 
7 colonies expressing cellulase in the presence of 4% glucose were isolated. NGA58 containing the "in 
vivo" constructed plasmid pMPAGC expresses active EGD constitutively (see Rg. 7). This result dem- 
onstrates that the newly constructed plasmid pMPADHI serves as a vector for foreign gene expression. 

In another experiment a recombinant plasmid was constructed by in vivo recombination to study the 
expression of the GAM/celD gene fusion under control of the PDC promoter (Das and Hollenberg, 1982). 
Therefore the 6.0 kb Sphl-Aval fragment from ptasmid pJDcg-15 (EP 87 110370.1) carrying the 
glucoamylase gene fused to the PDC promoter was subcloned into the respective restriction sites of 
YRpJD2 (EP 87 110370.1) resulting in plasmid pMPPG (Fig. 9a). 

For in vivo recombination pMPPG was cleaved with Clal and used together with BamHI linearized 
pMPGC1~2 foTtransformation of NGA58. Homologous recombination results in plasmid pMPPGC (Fig. 9b) 
harbouring a GAM-celD fusion under control of the PDC promoter. This construction leads to an expression 
in Schwanniomyces occidentalis NGA58 inducible by glucose and partly repressable by maltose (Table 1). 
Transformants were analysed for expression of active EGD using the qualitative congo red assay (Rg. 7). 
The results indicate that the 0.3 kb GAM promoter fragment leads to regulated expression, however less 
efficient that the 1.3 kb fragment (for comparison see table 1 and Fig. 2A). Expression under the ADH1 
promoter leads to efficient constitutive expression. The expression level is comparable with that of induced 
expression directed by the 1 .3 kb GAM promoter. 

The fact that plasmids can be constructed by in vivo recombination is a proof for homologous 
recombination and hence offers the possibility for stable site directed integration into the genome of S. 
occidentalis, a prerequisite for the industrial production of foreign proteins. 

The expression of the a-amylase gene and glucoamylase gene and secretion of the both gene products 
in S. occidentalis could also be achieved under control of the phosphoglycerate kinase (PGK) promoter 
(Do"bson et al., 1982). glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter (Holland and Holland 
1979), copperchelatin (CUP1) promoter (Karin et al., 1984, Butt et al., 1984) and alcoholdehydrogenase 2 
(ADR2 or ADH2) promoter (Russell et al., 1983, Beier et al.. 1985) or galactokinase (GAL1/10) promoter 
(Johnston and Davis. 1984). 



Construction of pJDcg-15 : 

The GAM gene was isolated as a 3.2 kb BamHI-Sall DNA fragment from plasmid pMaGAM B/S (Rg.4) 
and inserted into the respective restriction sites of plasmid pBM272 Johnston & Davis. 1984). The PDC 
promoter was isolated from plasmid pCP202 (Kellermann & Hollenberg, 1988) as a Sall-Sphl fragment and 
ligated into the respective single sites of vector YRp7 (Struhl et al., 1979). To be able to isolate the PDC1 
promoter as a BamHI fragment the 3.2 kb Sail fragment of PMaGAM B/S containing the GAM gene was 
fused to the PDC promoter via the single Sail site. The resulting BamHI fragment containing the PDC- 
promoter was ligated into the single BamHI site of pBM272 containing the GAM gene. The final plasmid is 
called pJDcg-15 (Rg. 8b). 

EXPRESSION OF HUMAN GRANULOCYTE/MACROPHAGE COLONY STIMULATING FACTOR (hGM- 
CSF) 

Th gene coding for hGM-CSF (Cantrelt et al., 1985) was obtained from British Biotechnology Ltd. as a 
Hindlll-EcoRI fragment Position +52 of hGM-CSF gene was fused to position + 102 of the GAM signal 
sequence under control of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the 
endopeptidase cleavage site of the GAM signal sequence is linked with a Met included in the synthetic 
gene offered by British Biotechnology Ltd.. followed by the first amino acid of mature hGM-CSF (Ala 18). 
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This fusion was inserted into a S. occidentaiis expression vector and transformed into NGA58. Western blot 
analysis revealed secretion of hGM-CSF protein. 



EXPRESSION OF INTERLEUKINE-2 

The gene coding for IL-2 (Taniguchi et al„ 1983) was obtained from British Biotechnology Ltd., as a 
EcoRI-Hindlll fragment. Position +61 of IL-2 gene was fused to position + 102 of the GAM signal 
sequence under control of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the 
endopeptidase cleavage site of the GAM signal sequence is linked with a Met included in the synthetic 
gene offered by British Biotechnology Ltd., followed by the first amino acid of mature IL-2 (Ala 21). This 
fusion was inserted into a S. occidentaiis expression vector and transformed into NGA58. Western blot 
analysis revealed secretion of IL-2 protein. 

ISOLATION OF AN ADDITIONAL SwARS ELEMENT AND A NEW SELECTABLE MARKER 

A new SwARS sequence (SwARS2), localized downstream of the coding region of the GAM gene, could 
be identified. To analyze the function of SwARS2 a vector was constructed containing the Trp5 gene and 
SwARS2 in pBR322, resulting in pMPTS2-A and pMPTS2-B, respectively (Fig. 10). For the construction of 
PMPTS2-A and pMPTS2-B the 5.5 kb BamHI - Bglll fragment of pBRSwARSGAM, constructed by insertion 
of the EcoRI-fragment shown in Fig.2a into the single EcoRI site of plasmid pBRSwARSI, (disclosed in EP 
87 110370.1), was isolated and ligated with 3.2 kb BamHI TRP5 fragment obtained from YRpJD2. High 
frequency transformation of S. occidentaiis is obtained using this plasmid. 

As an additional selectable marker the LEU2 gene of S. occidentaiis was isolated. The LEU2 gene, 
coding for isopropyl malate dehydrogenase, was cloned by functional complementation of the mutant AH22 
(his4leu2) (Hinnen et al., 1978) of S. cerevisiae by transformation with a cosmid genomic library from S. 
occidentaiis . 

The LEU2 gene was found to reside on a 9 kb EcoRi fragment of Schwanniomyces genomic DNA; tt s 

function could be abolished by insertion of the TRP5 gene of S. cerevisiae . The disrupted LEU2 gene was 
integrated into the genome of NGA58 by homologous recombination resulting in a new strain NGA581 
(TRP5. LEU2. and ADE). An analogous experiment with the cloned HIS4 gene of S. occidentaiis (EP 87 110 
370.1) led to a strain NGA58h (TRP5, HIS4, ADE). 

IMPROVED EXPRESSION OF HETEROLOGOUS GENES USING THE T7 RNA POLYMERASE AND T7 
PROMOTER AND TERMINATER FOR THE EXPRESSION OF FOREIGN GENES IN S. OCCIDENTALIS 

T7 RNA polymerase (Fuerst et al., 1986) can be expressed in S. occidentaiis after fusion of the 
structural gene (Dunn and Studier, 1984) using the BamHI restriction site near the translation initation codon 
with suitable promoters asa-amylase, GAM (0.3 kb fragment) or PDC and homologous integration into the 
genome via any homologous cloned gene, e.g. HIS4, LEU2, AMY1 and GAM. A suitable fragment, 
containing the fusion flanked by homologous cloned sequences can be isolated and used together with 
pCJDS-1 (Fig. 6c) for co-transformation of NGA58, selecting on YNB containing 0.5% casaminoacids. 
Colonies containing the integrated T7 RNA polymerase gene can be identified by Southern analysis after 
isolation of transformants mutated in the target gene. 

For the expression of foreign genes a S. occidentaiis expression vector was constructed containing 
SwARS2, the ColE replication origin of E. coli, a suitable selective marker for S. occidentaiis (e.g. TRP5, 
HIS4, LEU2) and a cassette containing the T7 polymerase promoter and terminator sequences isolated from 
plasmid pAR2529 (Fuerst et al., 1986), separated by the polylinker sequence from Ml3mp19. To study the 
efficiency of the T7 system we expressed a GAM/celD fusion (GAM signal sequence and structural celD 
gene obtained as a BamHI-Asp 718 fragment from pMPGC1-2) under control of the T7 promoter by 
insertion into the BamHI-Asp718 site of the polylinker sequence. Using this system active cellulase is 
secreted into the culture medium. 



MATERIALS AND METHODS 
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1 . The microorganisms used are as follows : 

Schwanniomyces occidentalis NGA23 : (DSM 3792) 
Schwanniomyces occidentalis NGA58 
Kluyveromyces Lactis SD1 1 : (DSM 3795) 
Schizosaccharomyces pombe LEU1-32, HIS5-303 (DSM 3796) 
Saccharomyces cerevisiae YNN27 (URA3-52, TRP1-298, GAL2 ) 

The Escherichia coli strain used is as follows E coli HB101 F-. hsd S20 (r B - m B •), recA13 , ara-14 , proA2 , 
lacY1, galK2, rpsL20 (Sm^, xyl-5 , mtl-1 , supE44 ) 



2. Culture media : 

Schwanniomyces occidentalis strain NGA58 (isolated by UV mutagenesis of strain NGA23) was grown 
at 30 C in YNB (0.67% yeast nitrogen base without amino acids) supplemented with appropriate amino 
acids and with 2% of soluble starch when indicated. 

S. cerevisiae strain YNN27 ( URA3-52 , TRP1-289 , GAL2 ) (Stinchomb et al 1980); K. lactis strain SD11 
TRPT (Das and Hollenberg, 1982); S. pombe strain LEU1-32, HIS5-303 (obtained from W.D. Heyer): These 
strains were grown under conditions as descirbed above for Schwanniomyces occidentalis with the 
replacement of 2% soluble starch by 2% glucose, where indicated. 

The culture media for S. occidentalis and Hansenula polymorpha were buffered with 0.2 M NaPO* 
buffer pH 6.2; those for S. cerevisiae and K. lactis were buffered with 0.1 M citrate buffer pH 6 and those for 
S. pombe with 0.05 M acetate buffer pH 6. 

~~The~pH optimum for the cellulase (Joliff et al., 1986a). a-amylase and glucoamylase (Wilson and 
Ingledew, 1982) is approx. pH 6. 

E. coli HB101 (Bolivar et al., 1977) was grown in LB medium (Maniatis et al., 1982) with penicillin G 

(150"ug7mi). 

3. Miscellaneous methods 

Plasmid DNA was purified either by CsCl gradient centrifugation (Maniatis et al., 1982) or by rapid 
alkaline extraction of plasmid DNA described by Birnboim and Doly (1979). 

Yeast crude extracts were prepared by the method of Senate (1979). Yeast minilysates were prepared 
by the method of Sherman et al., (1983). 

DNA fragments were isolated by the "low melting agarose" procedure as described by Gafner et al., 
(1983). Southern hybridization was carried out as described by Southern (1975). 

Yeast transformation was performed according to Klebe et al. (1983) or Ito et al. (1983). The 
transformants or integrants were tested for expression and secretion of cellulase using either the qualitative 
congo red assay (Teather and Wood. 1982: halo formation on carboxymethylcellulose agar plates) or a 
quantitative cellulase assay according to Joliff et al. (1986a). One unit (U) is defined as the amount of 
enzyme that released 1 umol para-nitrophenol per min at 60° C. Western analysis was performed as 
described by Towbin et al. (1979). 

The transformants or integrants were tested for expression and secretion of a-amylase using either a 
standardized a-amylase enzyme test from Merck, W. Germany or a qualitative starch degradation test (halo 
formation on starch agar plates after staining with iodine), as follows : in this test the rate of formation of 2- 
chloro-4-nitrophenol is determined by photometry at 405 nm in 0.1 M potassium phosphate at 37 C. The 
enzyme unit (U) is defined as the amount of enzyme catalysing the formation of 1 umol 2-chloro-4- 
nitrophenol per min at 37* C. 

Glucoamylase activity was measured by a stop assay method: 
After incubation of the sample in 10% soluble starch in 0.05M KHaPCVNaOH (pH 5.0) at 50 C the amount 
of glucose produced is determined by the glucose dehydrogenase method (system glucose Merck). The 
quantity of NADH formed is proportional to the glucose concentration. The enzyme unit (U) is defined as 
the amount of enzyme catalysing the formation of 1 umol glucose/min at 50 C. 
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TABLE 1 



Transformant 


OD60O nm 


EDG-activity 
mil/ml 


NGA58: 


2.3 


0.12 


pMPGCl-1 


5.3 


0.73 


(0.3 kb promoter) 






NQA58: 


2.0 


0.53 


pMPGCl-2 


7.9 


1.83 


(1 .3 kb GAM promoter) 






NGA58: 


3.0 


0.52 


pMPAGC 


5.9 


1.07 


ADH1 promoter 






NGA58: 


2.2* 


0.40* 


pMPPGC 


4.5* 


0,64" 


PDC1 -promoter 


2.9 


0.19 




7.2 


0.31 


Transformants were grown in YNB, 0.5% 
Casamtnoacids, 2% maltose, 20 mg/l adenine and 
0.2 M NaP buffer pH 6.2 



* : values for transformants grown in 4% glucose 

25 — 
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Claims 

1 . A yeast ceil of the genus Schwanniomyces , 
characterized in that 

said yeast cell contains at least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator. 

2. The yeast cell according to claim 1 . 
characterized in that 

said first and/or second and/or fourth DNA sequence are derived from a gene coding for an anxiolytic 
enzyme. 

3. The yeast cell according to claim 2. 
characterized in that 

said amylolytic enzyme is an a-amylase or glucoamylase. 

4. The yeast ceil according to any of claims 1 to 3, 
characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera 
Debariomyces. Saccharomycopses , Lipomyces , Pichia , or Saccharomyces , preferably Schwanniomyces . 

5. The yeast cell according to claim 4, 
characterized in that 

said yeast is Schwanniomyces occidentalis . 

6. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence is comprised by a 1.8 kb Bglll-Xhol-fragment prece ding the structural g ne 
coding for the Schwanniomyces a-amylase or the 1.3 kb BamHl-Pvull fragment preceeding the structural 
gene coding for the Schwanniomyces occidentalis glucoamylase. 

7. The yeast cell according to claim 5, 
characterized In that 
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said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 
of the region proceeding the structural gene coding for a-amylase or part or all of the DNA sequence 
corresponding to bases -1 to -320 of the region preceeding the structural gene coding for glucoamylase. 

8. The yeast celi according to any of claims 1 to 5, 
5 characterized In that 

said first DNA sequence corresponds to one of the following regulons : 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, QAPDH, the regulon being preferably obtained from Sac- 
c haromyces cerevisiae 

b)~LAC4 the regulon being preferably obtained from Kluyveromyces lactis, or 

70 c) corresponding regulons, preferably obtained from Schwanniomyces . 

9. The yeast cell according to any of claims 1 to 5, 
characterized In that 

said first and/or fourth DNA molecule corresponds to a viral promoter and/or terminator, preferably from E. 
coli phage T7. 

75 10. The yeast cell according to any of claims 1 to 9, 

characterized in that . 

said second DNA sequence is coding for all or part of at least one of the following peptides : 

a) MetArgPheSerThrGluGlyPheThrSerLysValvAIAIaAlaiieLeuAlaPheSerArgLeuValSerAla; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle LeuAlaPheSer- 
20 ArgLeuValSerAlaGlnProllellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu ValSerAla; 

d) MetllePheLeuLysLeulleLysSerlleVallleGlyLeuGlyLeu ValSerAlalleGlnAlaAlaProAla; 

Q j MetllePheLeuLysLeulleLysSerlleVallLeGlyLeuGlyLeu 

ValSerAlalleGlnAiaAlaProAlaSerSeriieGlySerSerAlaSerAla 
25 11. A yeast cell according to claim 10, 
characterized in that 

the DNA sequence coding for said peptide corresponds to part or all of one of the following DNA 
^^TG^^ TTAGCATTCT CAAGATT- 

30 GQTA ™T<^ TAGCATTCTCAAGATT- 

GGTATCTGCTCAACCGATTATTTTTGALATG AG A; 

C) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA G 

d) ATGATTTTTCTGMGCTGATTAAMGTATAGTMTTGGTTTGGGATTA GTTAGTGCTATCCAAG- 

35 t"**^^ GTTAGTGCTATCCAAG- 
CAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA GCA. 

12. The yeast cell according to any of claims 1 to 1 1 , 

40 SdtSoNA leqllence coding for a foreign protein is a natural or synthetical DNA sequence encoding a 
cellulase, interleukine, insulin like growth factor, interferon, lymphokine, human growth factor, nerve growth 
factor, aprotinin, insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral 
or bacterial vaccines or human granulocyte/macrophage colony stimulating factor. 

13. The yeast cell according to any of claims 1 to 12, 

45 characterized in that . . _ d^wo 

said fourth DNA sequence originates from a yeast of a member of the genera Saccharomyces, Pichja, 

Hansenula, preferably Schwanniomyces . 

14. The yeast cell according to claim 13, 

so sriHo^ corresponds to part or ail of the terminator of the Schwanniomyces a-amylase 

gene or part or all of the terminator of the Schwanniomyces glucoamylase gene. 

15. The yeast cell according to claim 14, 

SHeS by nucleotides 1537 to 1740 of the Schwanniomyces a-amylas gene or 

55 nucleotides 2875 to 3320 of the Schwanniomyces glucoamylase gene. 

16. The yeast cell according to any of claims 1 to 15, 

SS^lSr'iiS and/or third and/or fourth DNA sequence has been modified by insertion or deletion 
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or substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or 
improved. 

17. The yeast eel! according to any of claims 1 to 16, 
characterized In that 

5 said expression cassette is carried by a circular or linear vector, which optionally comprises one or more 
selective marker genes. 

18. The yeast cell according to claim 17, 
characterized in that 

said vector comprises any of the following selective marker genes : 
10 TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, preferably obtained from Saccharomyces , Hansenula 
Pichia or Schwanniomyces , AMY1 or GAM1, preferably obtained from Schwanniomyces . 

19. The yeast cell according to any of claims 17 and 18, 
characterized In that 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable 
15 maintenance of the vector in the host organism. 

20. The yeast cell according to claim 19 
characterized In that 

said DNA sequence is derived from the genome of a member of the genera Saccharomyces , Fichia . 
Hansenula or Kluyveromyces , preferably Schwanniomyces . 
20 21 . A yeast cell according to any of claims 19 or 20, 
characterized in that 

said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS 
derived from the respective host organism. 

22. The yeast cell according to claim 21, 
25 characterized in that 

said DNA sequence is SwARSI or SwARS2, capable of controlling autonomous replication and stable 
maintenance of the vector in Schwanniomyces . 

23. The yeast cell according to claim 22, 
characterized in that 

30 said DNA sequences are comprised on a Clal-BamHI fragment (SwARSI) or a EcoRi-BamHI-fragment 
(SwARS2) of Schwanniomyces genomic DNA. 

24. The yeast cell according to any of claims 19 to 23, 
characterized In that 

said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, 
35 while retaining or improving its biological function. 

25. The yeast cell according to any of claims 1 to 24, 
characterized in that 

said yeast cell comprises any DNA-sequence homologous to genomic Schwanniomyces DNA, preferably 
homologous DNA-sequences flanking the expression cassette. 
40 26. The yeast cell according to any of claims 1 to 25. 
characterized in that 

said expression cassette is integrated into the genome, optionally in multiple copies, preferably via 
homologous recombination. 

27. The yeast cell according to any of claims 1 to 26 f 
45 characterized in that 

said yeast cell is derived from the genus Schwanniomyces and is capable of glycosylate. 

28. A process for the production of a polypeptide, wherein a yeast host organism is cultivated under 
suitable conditions and the polypeptide is recovered in a manner known per se, 

characterized in that 
so said yeast host organism is a yeast cell according to any of claims 1 to 27. 

29. The process according to claim 28 

characterized in that . 
yeast cells of the species Schwanniomyces occidentalis are used as a host organism and are preferably 
cultivated on starch.dextrin, maltose and/or plant biomass. 
55 30. The process according to any of claims 28 or 29, 
characterized In that 

yeast cells of the genus Schwanniomyces are used for the production of single cell protein. 
31 . The process according to any of claims 28 to 30, 
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characterized In that 

said polyp ptide is a-amylase or giucoamyiase. 

32. The process according to claim 28, 
characterized In that - 
the signal sequence of a yeasta-amylase or giucoamyiase gene is used for the production and secretion of 
foreign protein in yeast of the genera Saccharomyces , Kluyveromyces , Hansenula t Pichia or Schizosac- 
charomyces . 

33. Use of a yeast ceil according to any of claims 1 to 27 for transformation and expression of foreign 

genes. 
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Description 

The present invention relates to yeast ceils of the genus Schwannjqmyces and a process for the production of 
polypeptides. 

For expression of eukaryotic proteins of biotechnogical relevance there exists a variety of host systems, which are 
selected in compliance with the specific requirements. The most frequently used host organism is Eschejichia coli, 
being the best known representative of prokaryotic host organisms; Saccharomy_ces cereyjsjae and animal tissue cul- 
tures are used as eukaryotic host cells. For certain purposes, e.g. where expression of a glycoprotein is desired, 
expression in Exolj or other prokaryotic organisms is not appropriate, since same lack any glycosylation capability. 
On the other hand, the mass production of protein in tissue cultures still is cumbersome and expensive; therefore, the 
organism of choice in many instances is yeast of the well characterized genus Saccharamyces. cereyisiae is able 
to secrete efficiently small proteins, e.g. polypeptides like MG1 (Kurjan and Herskowitz, 1982), ^-endorphin or ct-inter- 
feron (Bitter et al„ 1 984), however proteins of higher molecular weight are captured in the periplasmic space (Emr et 
al, 1981) or the cytoplasma, thus causing decreased growth rates or even death of the cell. 

Therefore it is an object of the present invention to provide an improved eukaryotic expression system, which is 
optionally capable of secreting proteins encoded by a foreign DNA into the culture medium. 

This object has been solved by a yeast cell of the genus Schwanniqmy_c_es, wherein said yeast cell contains at 
least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator, wherein the first, second and fourth DNA sequence 
are derived from yeast. 

The invention, which comprises further subjects, is now described in a more detailed manner by the following 
description, examples and figures. 

BRIEF DESCRIPTION OF THE FIGURES 

Fig.1 

A: Restriction map of the AMY1 gene encoding a-amylase and its 5' and 3' noncoding region 

B: Nucleotide sequence of the AMY1 gene and its flanking region. In the 5' region the transcription initation 
site is indicated by a dashed line above the sequence. Within the structural gene the presumptive signal pepti- 
dase cleavage sites are indicated by star-symbols. The two potential, glycosylation sites, the cystein residues 
presumably forming four disulfide bonds (concluded by homology to the A. oryzae enzyme) are underlined. 
Within the 3' end region the transcription termination signal TAG . . . TATGT . TTT is indicated by a solid line. 

Fig.2 

A: Restriction map of the GAM1 gene encoding glucoamylase and its 5' and 3' noncoding region 

B: Nucleotide sequence of the GAM1 gene and its flanking region. In the 5' region the transcription initiation 
site is indicated by a dashed line. Within the structural gene the presumptive signal peptides cleavage sites 
are indicated by star-symbols. 

Fig.3 

M13 Xhol. The EcoRI-Scal fragment, containing a 1.8 kb promoter fragment and approx. 0.4 kb structural 
gene fragment of the a-amylase gene, was subcloned into M1 3mp8 previously digested with EcoRI and Smal. 
By site directed mutagenesis a Xhol-site was inserted just in front of the ATG codon: 
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* * * 

5 ' . . TAAAATAAAA6CTCGAG A JG^AG A TJ_T t . ^ • 

Positions where bases have been exchanged by site directed mutagenesis, are indicated by asterisks. 
Fig.4 

pMaGAM-B/S. The Bglll fragment containing the glucoamylase gene and its 5' and 3' flanking region was sub- 
cloned into the BamHI site of vector pM AC5-8 (Kramer et al. , 1 984). By site directed mutagenesis using the gapped 
duplex DNA methode (Kramer et al., 1 984) a BamHI site and a Sail site were inserted just in front of the translation 
initiation codon ATG: 



* ****** ** 

5 ' . . CTCATGACTGTGTCGACGGATCCAAG AJ1&&I JiLZ* j • 

Positions affected by the site specific mutagenesis are indicated by asterisks. 
Fig. 5 

YRp7aGAM. The GAM gene was isolated from pMaGAM-B/S as a Sail fragment and ligated into the Xhoi site of 
M13aXho (Fig. 3). The Bglll fragment harbouring the a-amylase promoter GAM gene fusion was ligated into the 
single BamHI site of the S. cereyisiae vector YRp7 resulting in plasmid YRp7aGAM. 

Fig.6 

Construction of S^gccideji talis vectors for expression of foreign genes using the promoter and the signal sequence 
of the glucoamylase gene. 

a: Insertion of the 4.0 kb EcoRI-Pvull fragment from pBRSwARSGAM, containing the complete GAM promoter 
and the first 208 bp of the GAM encoding sequence into pRR322, cleaved with EcoRI/Pvull, resulting in vector 
pBRGAM. 

b: Insertion of the 3.4 kb Pvull fragment from plasmid pCT603, containing the structural gene coding for cel- 
lulase from base pair position +112, into the Pvull site of plasmid pBRGAM, resulting in vector pBRGCI. 

c: Insertion of the 5.5 kb Bglll-Pstl fragment from plasmid pBRGCI into BamHI/Pstl cut pCJD5-1 . The resulting 
plasmid, pMPGCI -1 contains 321 bp of the GAM 5* non coding region and the first 208 bp of the GAM coding 
region fused to position +112 of the celD gene. 

d: Insertion of the 6.5 kb BamHI-Pstl fragment from plasmid pBRGCI into BamHI/Pstl cut pCJD5-1 . The re- 
sulting plasmid, pMPGCI -2 contains 1 .3 kb of the GAM 5' non coding region and the first 208 bp of the GAM 
coding region fused to position +112 of the celD. 

Fig. 7 

Promoter studies using the qualitative endoglucanase D activity assay (congo red staining) with different trans- 
formants: NGA58: pMPAGC, NGA58: pMPPGC, NGA58: pMPGCI -1, NGA58: pMPGCI -2 

a: Growth on 2% maltose 
b: Growth on 4% glucose 

Fig.8 

Construction of a S. occjdejitalis vector for expression of foreign genes under control of the ADH1 promoter. 

a: Insertion of the 1.45 kb BamHI-Sall ADH1 promoter fragment of plasmid pFM2-1 intopCJD5-1, previously 
cut with BamHI/Sall resulting in plasmid pMPADHL 

b: Insertion of the 3.2 kb Sail fragment of pJDcg-15 into the Sail site of pMPADHI, resulting in pMPAG. 
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c: Exchange of the GAM promoter in plasmid pMPGCI -2 (Fig 6d) for the ADH1 promoter from plasmid pMPAG 
by in vivo recombination, resulting in plasmid pMPAGC. 

Fig. 9 

Construction of a S. occidental is vector for the expression of foreign genes under control of the PDC promoter. 

a: Insertion of the 6.0 kb Sphl-Aval fragment from plasmid pJDcg-15, containing the PDC promoter fused to 
the GAM gene into YRpJD2 previously cleaved with Sphl-Aval, resulting in plasmid pMPPG. 

b: Exchange of the GAM promoter in plasmid pMPGCI -2 (Fig. 6d) for the PDC promoter from plasmid pMPPG 
by jn_ vivo recombination, resulting in plasmid pMPPGC. 

Fig. 10 

Construction of an autonomous replicating vector (containing SwARS2) for transformation of S. occjde_ntalis. 
The 3.2 kb TRP5 BamHI fragment from plasmid YRpJD2 was ligated with the 5.5 kb BamHI-Bglll fragment of 
plasmid pBRSwARSGAM (Fig 6a), resulting in pMPTS2-A and pMPTS2-B. 

ABBREVIATIONS USED THROUGHOUT THE APPLICATION 

Restriction endonucleases : 



Aa 


-Aatll 


A 


- Asp718 


B 


-BamHI 


Bg 


-Bglll 


C 


-Clal 


E 


- EcoRI 


EV 


- EcoRV 


H 


- Hindlll 


P 


-Pvull 


Ps 


-Pstl 


S 


-Sail 


Sc 


- Seal 


Sm 


-Smal 


Sp 


-Sphl 


X 


-Xhol 



Throughout this application various publications are referenced by mention of the first author and the year of 
publication within parentheses. Full citation of the references may be found at the end of the Specification as an annex, 
listed according to their alphabetical order immediately following the specification. The disclosures of these publications 
in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of 
the art as known to those skilled therein as of the date of the invention described and claimed herein. 

The yeast cell according to the present invention is characterized by containing an expression cassette which in 
turn is composed of three to four different components, being arranged in operable linkage in order to ensure efficient 
expression of a desired polypeptide, optionally followed by secretion of same. These components are defined to be 
the following : 

a) A first DNA sequence serving as a regulon. The term "a regulon" as used throughout the application comprises 
any cis-acting DNA sequence involved in the regulation of expression of a given structural gene. The term thus 
embraces sequences preceeding a given coding sequence, for example, a promoter and sequences which are 
recognised by transcription factors or other proteins involved in transcription regulation. The "first DNA sequence" 
is thus not limited to sequences corresponding to known promoters. Also included are DNA sequences corre- 
sponding to natural occuring regulons, which have undergone insertion, deletion or substitution events, but still 
retain their activity as regulon. Further the term "regulon" includes DNA sequences involved in regulation of tran- 
scription, which have been synthesised chemically or obtained by methods of gene technology. 

b) An optional second DNA sequence coding for a signal peptide. The meaning of this second DNA sequence 
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includes various DNA sequences, directing the secretion of any polypeptide containing the thus encoded peptide 
at its N -terminal end. 

c) A third DNA sequence coding for a foreign protein. This coding sequence coding for the signal peptide in order 
to retain a proper reading frame for the third DNA sequence. The fusion thus requires application of methods or 
recombination DNA technology known to every person skilled in the art. For methods enabling to design proper 
fusions it is in general referred to Maniatis et al. (1982), wherein appropriate methods, e.g. various ways to treat 
restriction fragment ends, cloning of linker molecules and similar procedures are described in great detail. The 
foreign protein coded for by the third DNA sequences may be any protein which to express there exists some 
need. Since the secretion system is capable of secretion not only of small molecules, but also of large proteins, 
expression of any desired protein should be possible. For examples it is referred to the discussion below. 

d) Optionally a fourth DNA sequence serving as a terminator. As the terminator any sequence may be used, which 
efficiently terminates transcription in the respective host organism. Terminators may be for example sequences 
prone to formation of hairpin structures and/or polyadenylation sites. In a preferred embodiment the terminator is 
derived from the same gene as the second DNA sequence, i.e. the regulon, is derived from. 

The four components are combined with each other using known techniques. Optionally the proper fusion will be 
verified for example by sequencing the boundaries. 

The yeast cell according to the present invention contains at least one expression cassette comprising some or 
all of the above mentioned DNA sequences. It provides a convenient tool for the industrial expression of any desired 
protein. 

In a preferred embodiment said first and/or second and/or fourth DNA sequence are derived from a gene coding 
for an amylolytic enzyme, for example a-amylase or glucoamylase, which enzymes in most organisms are induced 
upon contact with starch. The DNA sequences may be derived from a yeast of a member of one of the genera Debar_- 
tomy_ce§, Saccharqrrjy_ces , Lippmyces, Pjchia, or Sacc h_a_rqm y_c_e s , but yeast of the genus Schwannjqmyces are pre- 
ferred. 

Yeast species of the genus Schwannjqmyces have an efficient expression and secretion apparatus, enabling them 
to grow on starch as a sole carbon source. Starch is hydrolysed to glucose by two amylolytic enzymes, secreted into 
the supernatant, namely a-amylase (a- 1 ,4-glucan 4-glucanohydrolase E.C.3.2. 1.1) and glucoamylase (syn. amyloglu- 
cosidase) (a-1,4-glucanglucohydrolase, E.C.3.2. 1. 3.; with debranching activity E.C.3.2. 1. 9.). The amylolytic enzymes 
of Schwannjqmy_ces_c^cidentaNs, formerly also called Sch wan njqmyc_es casteNij and Sc h wan njqmyce s _a |l uvt u s (Price 
et al., 1978^ are well documented (Oteng-Gyang et al., 1981; Sills et al., 1 982, 1984a, 1984b" Wilson et"al~"l 982). 

The genes for a-amylase and glucoamylase from a yeast of the genus Schwannjqmyces have been disclosed by 
the present inventors in EP-A-0260404. However, at that time the DNA or peptide sequences directing secretion have 
not been identified. Furthermore their use for the establishment of an efficient expression and secretion system in yeast 
was not known then. 

It is preferred, to use as a first DNA sequence part or all of a 1.8 kb Bglll--Xhol fragment which preceeds the 
structural gene coding for the Schwannjqmyces a-amylase, or part or all of a 1 .3 kb BamHI-Pvull fragment preceeding 
the structural gene coding for the Sc h wa n njqrri y_c_e s _qcc id e n ta lis glucoamylase. Further if only smaller fragments are 
to be used, it is preferred, to use a DNA sequence comprising part or all of the DNA sequence corresponding to bases 
-1 to -540 of the region preceeding the structural gene coding for a-amylase or part or all of the of DNA sequence 
corresponding to bases -1 to -320 of the region preceding the structural gene coding for glucoamylase. The respective 
DNA fragments are fully active as promoters, inducible by starch, dextrin, maltose. 

It is obvious, that replacement of bases or base pairs, not involved in the transcriptional regulation, still are within 
the spirits of the present invention. Thus natural mutations or synthetic equivalents also are embraced by the present 
invention. 

In yeast cells of the genus Schwannjqmyces it is also possible to use promoters derived from yeast genes obtained 
from many other yeast genera. Preferred are regulons involved in the expression of ADH1, ADH2, PDC1, GAL1/10, 
PGK and GAPDH, or LAC 4 which are obtained from Sa cc h_ajqm yc e s_ce rev isiae, K|uyyerpmyces Lactis or Schwann)-; 
pmyces qcc|d_entaji§. 

Furthermore, it is possible to use viral regulons and/or terminators, for example regulons and terminators obtained 
from Ecolj T-phages. The phages possess extraordinarily strong functional sequences for performance of their lytic 
functions. In a preferred embodiment sequences of phage T7 are used, which are regulated by expression of T7 RNA 
polymerase. 

In order to increase the over all expression level and furthermore, in order to facilitate purification of produced 
polypeptides, it is most desirable, that the polypeptides are secreted into the culture medium. The present invention 
provides signal sequences having the invaluable advantage of being derived from genes of amylolytic and thus indue- 
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ible enzymes, which upon induction have to be secreted very efficiently. A further advantage is, that induction of amy- 
lolytic enzymes does not require complicated media, lacking for example a specific amino acid or phosphate, but only 
require the culture to be transferred to a starch containing medium. Signal sequences, which are preferably used for 
secretion of large proteins include part or all of at least one of the following peptides: 



a) Met ArgPheSerThrGLuG LyPheThrSerLysValVal At aAlalLe 
LeuALaPheSerArgLeuValSerAta; 



b) MetArgPheSerThrGLuGlyPheThrSerLysVaLValALaALalle 
LeuAlaPheSerArgLeuValSerAlaGlnProHelLePheAspMetArg; 



c) Met I L ePheLeuLy sLeuU eLysSerl t eVal I L eG L yLeuG LyLeu 
VaLSerAla; 



d) Met I lePheLeuLysLeuILeLysSerlleVat I LeGlyLeuG LyLeu 
ValSerALalLeGLnAlaALaProALa; 



e) Met HePheLeuLysLeuILeLysSerHeVal ILeG LyLeuG LyLeu 
ValSerAlalleGLnAl a Al aProA L aSe r Ser I L eG I y Se rSe r A I aSe r AL a 



In a further embodiment the above mentioned peptides are coded for by DNA sequences, which correspond to all 
or part of the following DNA sequences: 



a) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTTT 
AGCATTCTCAAGATTGGTATCTGCT; 



b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATT6GTATCTGCTCAACCGATTATTTTTGACATGAGA; 



c) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 
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d) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTT6GGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC; 

e) ATGATTTTTCTGAAG CTGATTAAAAGTATAGTAATTGGTTTGGG ATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 

It will be understood, that the replacement of amino acids by residues having comparable properties and use of 
alternative codons in compliance with the possibilities provided by the genetic code will not have any impact on the 
yeast cell according to the present invention. 

The polypeptide to be expressed in a yeast cell according to the present invention is coded for by a third DNA 
sequence. Said third DNA sequence may comprise natural or synthetic DNA, even DNA containing intervening se- 
quences. Examples for proteins being expressed by the yeast cell according to the present invention comprise proteins 
of enormous importance for scientific or medical purposes, for example cellulase, interleukine, interferon, insulin like 
growth factor, lymphokine, human growth factor, nerve growth factor, aprotinin, insulin, hirudin, hormones, blood clotting 
factors, hepatitis B surface or core antigens, viral or bacterial vaccines or human granulocyte/macrophage colony 
stimulating factor. 

The yeast cell according to the present invention may optionally comprise a terminator as component of said 
expression cassette. Preferred terminators are derived from genes of one of the following genera : 

Saccharqmyces, Pjchja, Hansenuja, but preferably Schwannjqmyce. Examples of preferred terminators are those 
derived from S ch wan njqmyc es a-amylase gene or Schwannjqmyces glucoamylase gene, part or all of which my be 
used. Such terminators are comprised by nucleotides 1537 to 1740 of the Schwanniqm^ces a-amylase gene or nu- 
cleotides 2875 to 3320 of the Schwa wijqmyces glucoamylase gene. 

It is obvious, that in case of any of the DNA sequences mentioned modifications, which do not impair the biological 
function of the respective DNA fragment still are covered by the present application. There are numerous possibilities 
to change the DNA sequences mentioned above slightly without impairing the biological function of said fragment gene. 

The yeast cell according to the present invention, carrying an expression cassette composed of the above dis- 
cussed components, may be carried by a circular or linear vector, which optionally comprises one or more selective 
marker genes. The present inventors demonstrated, that a lot of marker genes so far used for transformation of Sac : 
charpmyces, for example TRP5, LEU2, ADE1 , ADE2, HIS3, H1S4, URA3, LYS2, which may be obtained from Saccha_- 
rqmyces, Hansenuja, Pjchia or Schwannjqmyces, are functional in Sch wan nj qm y_c_es joccid e nta li s. Furthermore it is 
also possible to use the amylolytic properties of a-amylase or glucoamylase in order to select transformed organisms. 
The corresponding genes have been isolated and characterized. AMY1 and GAM1 are preferably obtained from 
Schwannjqmyces. 

The yeast cell according to the present invention may contain the expression cassette as an insert into one of the 
endogenous yeast chromosomes. Furthermore, there is however the possibility, that a DNA sequence capable of con- 
trolling autonomous replication and even also stable maintenance of the vector in the host organism, is introduced into 
the yeast cell. Said DNA sequences are the so called ARS sequences, capable of controlling autonomous replication 
and stable maintenance in the respective host organism. Said DNA sequences are preferably derived from Sacchaj 
rqmyces, Pich[a_ Hansen u la or KJuyy e r p my c e s , most preferred from Sch wan njqmy_c_es . 

In order to ensure effective replication in the prospective host organism, it is preferred to use an ARS, derived from 
the respective host organism. 

The present invention provides one new SwARS sequence. The DNA sequence SwARS2 is capable of controlling 
autonomous replication and stable maintenance of the vector in a Schwannjqmyces yeast cell. The detailed sequence 
of SwARS has not yet been determined. SwARS2 is localised in the Schwannjqmyces genome in the 3' region of the 
glucoamylase gene, as shown for example in Fig. 10. SwARS2 is easy to identify and efficiently allows autonomous 
replication of any plasmid, exhibiting this sequence. The so far known Schwannjqmyces autonomously replicating 
sequences, SwARSI and SwARS2 are comprised on a Clal-BamHI fragment (SwARSI) adjacent to the EcoRI-frag- 
ment shown in Fig 2a (see Fig. 10) and/or an EcoRI-Bglll fragment contained in the EcoRI fragment shown in Fig 2a 
(SwARS2). 

For the SwARS sequences it is also not necessary, in any case to use the sequence as it occurs naturally. DNA 
sequences, that have been obtained by any kind of modifying treatment, for example by deletion, insertion or substi- 
tution of one or more nucleotides or base pairs, are still within the scope of the present invention, provided the modi- 
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fications retain or improve the biological function of the yeast cell according to the present invention. 

In a preferred embodiment the yeast cell according to the present invention comprises a DNA sequence, which is 
homologous to genomic Sch wan njqmyces DNA. Provision of structures of homologous DNA enable the DNA fragment 
flanked by such homologous sequences to be inserted by homologous recombination. It is thus preferred to have 
s homologous sequences, which do not represent any essential gene of the Schwannjqmyces genomic DNA, but rather 
any gene, whose expression product may be supplemented by additives to the respective medium or were lack of the 
respective expression product does not have any serious impact on the transformed yeast cell. 

If no SwARS sequence or any equivalent sequence, for example originating from any other yeast species, is 
available, the expression cassette can be used without an autonomously replicating agent. In this case the expression 
10 cassette tends to integrate into the genome, for example via homologous recombination. Preferably there are several 
copies inserted into one or more chromosomes. 

In this case it is preferred, to have multiple copies of the expression cassette inserted, in order to ensure simulta- 
neous transcription and translation of several copies. 

When the yeast cell according to the present invention is derived from a yeast of the genus Schwannjqmyces the 
1$ polypeptide products observed following expression of the respective foreign gene in said yeast cells exhibit glyco- 
sylation patterns, which closely resemble or even match those of the desired eucaryotic protein. Yeast cells of the 
genus Schwannjqmyces are thus preferred for expression of eucaryotic proteins. 

The above discussed yeast cell according to the present invention can be used in a process for the production of 
a polypeptide, wherein a yeast host organism is cultivated under suitable conditions and the polypeptide is recovered 
20 in a manner known per se. The conditions of cultivation depend on the respective host organism used. Examples for 
cultivation conditions for yeast of the genus Schwannjqmyces are provided in the examples below. The polypeptide, 
produced by the yeast according to the present invention, is recovered by conventional purification methods, which 
depend on the properties of the expressed protein, the availability of specific absorbing agents and so on. 

In a preferred embodiment yeast cells of the species Schwannjqmyces occidentaljs are used as a host organism 
25 and are cultivated on starch .dextrin, maltose and/or plant biomass as the respective carbon source. As commonly 
known, these carbon sources are available at low costs. 

Furthermore, there is a possibility to use cells of the genus Schwannjqmyces for the production of single cell 
protein. The single cell protein may then be subjected to any further use as it is required. 

In a preferred embodiment the polypeptide produced in a process according to the present invention is an amylolytic 
30 enzyme, for example an a-amylase or glucoamylase. These enzymes are commercially used to debranch and degrade 
starch, which thereupon may be further processed, for example in brewery or baking processes. There is a high need 
for the provision of starch degrading enzymes. 

In a further embodiment of the process according to the present invention production of a polypeptide may be 
achieved by providing expression cassettes as discussed above containing a yeast a-amylase or glucoamylase signal 
35 sequence and inserting the respective expression cassette into a yeast of one of the genera Saccha^qmyces, Kkjy: 
yeromyces, Hansen u]a p Pichia or Sch i zos acc h a rorny ces . Production and secretion of foreign proteins in the respective 
yeast host organism is obtained by applying the appropriate inductive measure, whereupon the produced protein is 
secreted due to the presence of the yeast a-amylase or glucoamylase signal sequence. Since it is known from exper- 
iments performed by the present inventor, that these signal sequences are recognised in virtually any yeast genus, 
40 the a-amylase or glucoamylase signal sequences may be applied in many different yeast with success. Also in this 
case, it is preferred, to use the a-amylase or glucoamylase signal sequences derived from the respective genes of 
Schwannjqmyces _occidentalis. 

In summary the yeast cejls according to the present invention are well suited for transformation and expression 
of any desired foreign gene . As already outlined above, expression of procaryotic as well as eucaryotic genes is 
45 possible. However, for expression of eucaryotic proteins yeast celts of the genus Schwamjqmyces containing some 
or all of the DNA sequences mentioned above are the most preferred yeast cells. 

PJlAJiACTE_RISAJION OF.THE a-AMYLASE AND GLUCOAMYLASE GENE EXPRESSION IN DIFFERENT YEAST 
G~ENER A ~. 

so 

The N-termini of the deduced amino acid sequences of the a-amylase gene {Fig. 1 B) and glucoamylase gene (Fig. 
2B) exhibit the features of typical leader sequences of secretory proteins (Von Heijne 1983, Perlman and Halvorsson 
1983). The N-terminal amino acids of mature a-amylase secreted by S^ occidejitalis are determined to be AspValser. 
This indicates processing between Arg33 and Asp31 which is unusual and not according to the Von Heijnes model of 
55 site specificity for signal peptidase cleavage. This putative cleavage site was confirmed also for a-amylase expressed 
in S. cereyisiae, Pich[a stjpjdis, K. lactis, S. pombe and K polymorpjia. However, in these yeast species processing 
is also found to occur between Ala 25 and Gin 26, resulting in the N-terminal sequence GlnProlle. 

a-amylase isolated from a culture of S. occidejitalis migrates as a single band of 55 kDa on SDS polyacrylamide 
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gels: following Endoglucosidase H treatment and subsequent gel electrophoreses a single band of 54 kDa is observed, 
indicating that only one of the two potential N-glycosylation sites of the protein is used for glycosylation. It is noteworthy 
that high mannose glycosylation, as very often found by proteins secreted in Sj.ee/evisiae is not accompanying secre- 
tion in S. occidental is. 

5 

IDENTIFICATION OF THE TRANSCRIPTION INITATION SITES AND PROMOTER FUNCTION OF THE a-AMYLASE 
AND GLUCOAMYLASE GENE. 

By S1 mapping of polyA-mRNA (Sharp et al., 1980) the transcription initiation site of the ct-amylase gene could 
10 be localised to be in the region of -42 to -30 (indicated by an interrupted line above the relevant sequence in Fig.1 B) 
and the transcription initiation site of the glucoamylase gene was determined to be in the region -7 to -10 (underlined 
in Fig.28). 

CONSTRUCTION OF AN a-AMYLASE - GLUCOAMYLASE HYBRID GENE 

15 

In order to facilitate use of the a-amylase promoter and glucoamylase structural gene in further constructions 
additional restriction sites were inserted in front of the translation initiation codon of both genes using oligonucleotide 
directed mutagenesis as described in the AMERSHAM protocol. Correct mutagenesis was verified by sequencing 
according to Sanger (1 977). The boundaries between the original Schwannjqmyces genes and the inserted polylinker 
20 are shown below : 



OC-araylase : 

25 

** * 

5 1 . . TAAAATAAAAGCTCGAGATG_A_GAJ_T.T._. .3 1 . 
_X_h_oI 

30 

G I u co amy I a se : 

35 

* ★**•**★ 

5 1 . . CTC ATGACTGTGTCG ACGGATCC AAG ATG.ATTJJit. .jj^ • 
_SaJJ,„ BamHJ 

40 

The manipulations resulted in plasmids M13aXho (Fig.3) and pMaGAM-B/S (Fig.4). 

Thus, both the a-amylase promoter and the structural glucoamylase gene can be isolated from plasmid M1 3a Xho 
and pMaGAM-B/S as suitable DN A fragments. The GAM gene was isolated from pMaGam B/S as a 3.2 kb Sail fragment 
45 and ligated into the Xhoi site of M13aho. The Bglll fragment harbouring the a-amylase promoter/GAM gene fusion 
was ligated into the single BamHI site of the S. cereyisiae vector YRp7 (Struhl et al., 1979) resulting in plasmid 
YRp7aGAM (Fig 5). S. cereyisiae strain YNN27 was transformed with this plasmid selecting for TRP prototrophy. 
Transformants secreted active glucoamylase into the culture supernatant (50 mU/ml) indicating that the a-amylase 
promoter fragment can direct expression and secretion of foreign proteins. 

so 

SECRETION OF a-AMYLASE AND GLUCOAMYLASE IN DIFFERENT YEASTS USING SUITABLE PROMOTERS. 

Expression of the a-amylase and glucoamylase gene and secretion of the gene products can be detected in 
Jactis under control of the promoter of the a-galactosidase (LAC4) gene (Breunig et al., 1984). Expression of the a- 
55 amylase and glucoamylase gene and secretion of the both proteins can also be detected in S^ pombe under control 
of the ADH1 promoter (Russel et al., 1983) or of the GAL1/10 promoters (Johnston and Davis 1984). 
After fusion with the GAL1/10 promoter (Johnston and Davis 1984), DHAS promoter (Janowicz et al., 1985), MOX 
promoter (Ledeboer et al., 1 985) or FMDH promoter (EP 87 110 417.0) expression of the a-amylase and glucoamylase 
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gene and secretion of both proteins can be detected in H_ansenu]a gqlymorpha. 

EXPRESSION AND SECRETION STUDIES IN S^OCCJDENTAUS USING THE CELLULASE ENDOGLUCANASE D 
(EGD)OF CLOSTRIDIUM JHERMOCELLUM AS A MODEL FOR FOREIGN GENE EXPRESSION. 

5 

As a first example for the expression of a foreign gene in _S_. occidental is, the celD gene from C. ]hejmocellum 
(Millet et al., 1985, Joliff et al., 1986b), which codes for a thermostable cellulase, namely an endoglucanase (EGD) 
was used. Advantages in using this system are : 

jo - specific reaction of EGD, which can easily be monitored 

a rapid qualitative assay for colonies with EGD activity (see material and methods) 
availability of antibodies against EGD for specific detection of translation products. 

CHARACTERISATION OF THE GLUCOAMYLASE PROMOTER AFTER FUSION WITH THE celD GENE 

15 

Plasmids were constructed containing a replicon from S. occjdentajis (SwARSI ) (EP 871 1 0370. 1 ) the TRP5 gene 
from acerevisiae (Zalkin and Yanowski.1982) as a selective marker and, in addition, a GAM/celD gene fusion under 
the control of different promoters. In the first step the 4.0 kb EcoRI-Pvull fragment from plasmid pBRSwARSGAM (Fig. 
6a) was isolated and inserted into the 2296 bp pBR322 EcoRI-Pvull fragment, containing the amp gene and the bacterial 

20 origin, resulting in plasmid pBRPGAM (Fig.6a). In addition to the pBR322 sequence this plasmid carries 3.6 kb from 
the 5' non coding region of the GAM gene and the first 208 bp coding for the N-terminal part (including the signal 
sequence) of the glucoamylase. A 3.4 kb Pvull fragment from plasmid pCT603 (Joliff et al., 1 986) containing the coding 
region of the celD gene missing the 5' 1 1 1 bp was inserted into the Pvull single site of pBRPGAM resulting in pBRGCI 
(Fig.6b). EL cqlj transformed with this plasmid results in weak Carboxymethylcellulase (CMCase) activity, analysed by 

25 congo red staining (see material and methods) indicating that the glucoamylase promoter is slightly expressed in E. 
coli. For the construction of a occjdentajis expression vector the plasmid pCJD5-1 (EP 87110370.1) was cleaved 
with BamHI/Pstl and ligated with" the 5.5 kb Bglll-Pstl fragment or 6.5 kb BamHI-Pstl fragment from pBRGCI , resulting 
in pMPGCI -1 (Fig.6c) and pMPGCI -2 (Fig.6d), respectively. Both plasmids share an in frame fusion of GAM/celD but 
differ in the length of the 5' upstream region of the GAM gene. S. occjdentajis strain NGA58 (trp5 ade) isolated after 

30 UV mutagenesis of strain NGA23 was transformed with these plasmids and transformants selected for tryptophan 
prototrophy and analyzed by means of congo red assay for cellulase activity. Transformants produced active EGD 
when grown under inducing conditions in 2% maltose (Fig 7a). By analysis with the congo red assay no activity could 
be detected under repressed conditions in 4% glucose (Fig. 7b). NGA58:pMPGC1-1 and NGA58:pMPGC1-2 trans- 
formants secrete active EGD into the culture supernatant when grown in maltose under induced conditions. The intact 

35 or complete glucoamylase promoter resides on plasmid pMPGCI -2 as shown by 20 times higher activity compared to 
the shorter promoter fragment in gene fusion of pMPGCM (table 1). However the 0.3 kb promoter fragment still 
harbours regulatory units which lead to catabolite repression by glucose (Fig 7b). Comparison of the N-terminal GAM 
sequence to the signal peptidase cleavage sites of several other secretory proteins (\fcn Heijne 1983) reveals three 
potential signal peptidase cleavage sites after the amino acid residues Ala 19, Ala 25 and Ala 34. For determination 

40 of the N-terminal amino acid of the mature glucoamylase the protein was isolated for N-terminal amino acid sequencing. 
However the N -terminus is blocked and therefore could not be sequenced. To confirm the correct cleavage site of the 
GAM signal sequence in S^ occidentals, the regions between the potential signal peptidase cleavage sites and the 
Pvull gene fusion site are deleted in the GAM/celD fusion by in vitro mutagenesis to study the influence on secretion 
of active EGD. 

45 

STUDIES OF THE USE OF SUITABLE &CEREVISIAE PROMOTERS IN THE EXPRESSION OF FOREIGN GENES 
IN S. OCCJ DENTAL IS. 

The ADH1 promoter (Hitzeman etal., 1981) was isolated as a 1.45 kb BamHI-Sall fragment from plasmid pFM2-1 
so (MulleretaL, 1987) and ligated into plasmid pC J D5-1 (EP87 110370.1) after cleavage with Bam HI and Sail resulting 
in pMPADHI (Fig.Sa). The GAM gene was excised as a Sail fragment from pJDcg-15 (see below) and inserted into 
the single Sail site of pMPADHI (Fig. 8b) resulting in plasmid pMPAG (Fig. 8b). By the means of |n vivo recombination 
(Ma et al., 1 987) the GAM promoter from pMPGCI -2 (Fig 6d.) was replaced by the ADH1 promoter of pMPAG leading 
to plasmid pMPAGC (Fig. 8c). Plasmid pMPGC1-2 was linearized with BamHI; the unique BamHI site of pMPGCI -2 
ss is localized between the TRP5 coding sequence and the glucoamylase promoter. The 2.75 kb Clal fragment of pMPAG 
containing the ADH1 promoter shares homologies to the TRP5 gene as well as to the GAM coding region up to position 
+208. For jn vivo recombination NGA58 was transformed with BamHI linearized pMPGC1-2 together with approx. 10 
fold excess of Clal fragment using the transformation protocol of Ito et al., 1983. 
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Recombinants were screened for their ability to hydrolyze CMC on 4% glucose. Under these conditions celD ex- 
pression is repressed in transformants harbouring ^circularized pMPGC1-2. From 146 transtormants 7 colonies ex- 
pressing cellulase in the presence of 4% glucose were isolated. NGA58 containing the "jn vivo" constructed plasmid 
pMPAGC expresses active EGD constitutively (see Fig. 7). This result demonstrates that the newly constructed plasmid 
5 pMPADHI serves as a vector for foreign gene expression. 

In another experiment a recombinant plasmid was constructed by m_ vivo recombination to study the expression 
of the GAM/celD gene fusion under control of the PDC promoter (Das and Holienberg, 1982). Therefore the 6.0 kb 
Sphl-Aval fragment from plasmid pJDcg-15 (EP 87 110370.1) carrying the glucoamylase gene fused to the PDC pro- 
moter was subcloned into the respective restriction sites of YRpJD2 (EP 87 110370.1) resulting in plasmid pMPPG 
10 (Fig. 9a). 

For jn_ vivo recombination pMPPG was cleaved with Clal and used together with BamHI linearized pMPGC1-2 for 
transformation of NGA58. Homologous recombination results in plasmid pMPPGC (Fig. 9b) harbouring a GAM-celD 
fusion under control of the PDC promoter. This construction leads to an expression in Schwannjqmycjis^qccidentalis, 
NGA58 inducible by glucose and partly repressable by maltose (Table 1). Transformants were analysed for expression 

is of active EGD using the qualitative congo red assay (Fig. 7). The results indicate that the 0.3 kb GAM promoter fragment 
leads to regulated expression, however less efficient that the 1 .3 kb fragment (for comparison see table 1 and Fig. 2A). 
Expression under the ADH1 promoter leads to efficient constitutive expression. The expression level is comparable 
with that of induced expression directed by the 1 .3 kb GAM promoter. 

The fact that plasmids can be constructed by vivo recombination is a proof for homologous recombination and 

20 hence offers the possibility for stable site directed integration into the genome of S. occjdejitalis, a prerequisite for the 
industrial production of foreign proteins. 

The expression of the a-amylase gene and glucoamylase gene and secretion of the both gene products in S. 
occidental could also be achieved under control of the phosphoglycerate kinase (PGK) promoter (Dobson et al., 
1982), glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter (Holland and Holland 1979), copperchelatin 

25 (CUP1 ) promoter (Karinetal., 1984, Buttetal., 1984)andalcoholdehydrogenase2(ADR2or ADH2) promoter (Russell 
et al., 1983, Beieretal., 1985) or galactokinase (GAL1/10) promoter (Johnston and Davis, 1984). 

Construction of pJDcg-15 : 

30 The GAM gene was isolated as a 3.2 kb BamHI-Sall DNA fragment from plasmid pMaGAM B/S (Fig.4) and inserted 

into the respective restriction sites of plasmid pBM272 Johnston & Davis, 1 984). The PDC promoter was isolated from 
plasmid pCP202 (Kellermann & Holienberg, 1 988) as a Sall-Sphl fragment and tigated into the respective single sites 
of vector YRp7 (Struhl et al, 1979). To be able to isolate the PDC1 promoter as a BamHI fragment the 3.2 kb Sail 
fragment of PMaGAM B/S containing the GAM gene was fused to the PDC promoter via the single Sail site. The 

35 resulting BamHI fragment containing the PDC-promoter was ligated into the single BamHI site of pBM272 containing 
the GAM gene. The final plasmid is called pJDcg-1 5 (Fig. 8b). 

EXPRESSION OF HUMAN GRANULOCYTE/MACROPHAGE COLONY STIMULATING FACTOR (hGM-CSF) 

40 The gene coding for hGM-CSF (Cantrell et al., 1985) was obtained from British Biotechnology Ltd. as a Hindlll- 

EcoRI fragment. Position +52 of hGM-CSF gene was fused to position + 102 of the GAM signal sequence under control 
of the ADH1 or GAM promoter. This fusion results in a hybrid protein in which the endopeptidase cleavage site of the 
GAM signal sequence is linked with a Met included in the synthetic gene offered by British Biotechnology Ltd., followed 
by the first amino acid of mature hGM-CSF (Ala 18). This fusion was inserted into a S. pccidentalis expression vector 

45 and transformed into NGA58, Western blot analysis revealed secretion of hGM-CSF protein. 

EXPRESSION OF INTERLEUKINE-2 

The gene coding for IL-2 (Taniguchi et al., 1983) was obtained from British Biotechnology Ltd., as a EcoRI-Hindlll 
so fragment. Position +61 of IL-2 gene was fused to position + 1 02 of the GAM signal sequence under control of the ADH 1 
or GAM promoter. This fusion results in a hybrid protein in which the endopeptidase cleavage site of the GAM signal 
sequence is linked with a Met included in the synthetic gene offered by British Biotechnology Ltd., followed by the first 
amino acid of mature IL-2 (Ala 21). This fusion was inserted into a S. pccidentalis expression vector and transformed 
into NGA58. Western blot analysis revealed secretion of IL-2 protein. 

55 

ISOLATION OF AN ADDITIONAL SwARS ELEMENT AND A NEW SELECTABLE MARKER 

A new SwARS sequence (SwARS2), localized downstream of the coding region of the GAM gene, could be iden- 
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tified. To analyze the function of SwARS2 a vector was constructed containing the Tips gene and SwARS2 in pBR322, 
resulting in pMPTS2-A and pMPTS2-B, respectively (Fig. 10). For the construction of pMPTS2-A and pMPTS2-B the 
5.5 kb BamHI - Bgll fragment of pBRSwARSGAM, constructed by insertion of the EcoRI -fragment shown in Fig.2a into 
the single EcoRI site of piasmid pBRSwARS1 p (disclosed in EP 87 110370.1), was isolated and ligated with 3.2 kb 
s BamHI TRP5 fragment obtained from YRpJD2. High frequency transformation of S. occjdejntalis is obtained using this 
piasmid. 

As an additional selectable marker the LEU2 gene of S. occidentalis was isolated. The LEU2 gene, coding for 
isopropyl malate dehydrogenase, was cloned by functional complementation of the mutant AH22 (his4leu2) (Hinnen 
et at., 1978) of cereyisiae by transformation with a cosmid genomic library from S. occjde_ntalis. 
10 The LEU2 gene was found to reside on a 9 kb EcoRI fragment of Schwannjqmyces genomic DNA; it's function 

could be abolished by insertion of the TRP5 gene of S. cereyisiae. The disrupted LEU2 gene was integrated into the 
genome of NGA58 by homologous recombination resulting in a new strain NGA581 (TRP5, LEU2, and ADE). An 
analogous experiment with the cloned HIS4 gene of S. occjdejitalis (EP 87 110 370.1 ) led to a strain NGA58h (TRP5, 
HIS4, ADE). 

15 

I MPROVED EXPRESSION OF HETEROLOGOUS GENES USI NG THE T7 RNA POLYMERASE AND T7 PROMOTER 
AND TERMINATER FOR THE EXPRESSION OF FOREIGN GENES IN S. OCCJDENTALIS 



T7 RNA polymerase (Fuerst et al., 1986) can be expressed in S. cccjdentalis after fusion of the structural gene 
20 (Dunn and Studier, 1984) using the BamHI restriction site near the translation initation codon with suitable promoters 
asa-amylase, GAM (0.3 kb fragment) or PDC and homologous integration into the genome via any homologous cloned 
gene, e.g. HIS4, LEU2, AMY1 and GAM. A suitable fragment, containing the fusion flanked by homologous cloned 
sequences can be isolated and used together with pCJD5-1 (Fig. 6c) for co-transformation of NGA58, selecting on 
YNB containing 0.5% casaminoacids. Colonies containing the integrated T7 RNA polymerase gene can be identified 
25 by Southern analysis after isolation of transformants mutated in the target gene. 

For the expression of foreign genes a S^pccidentalis expression vector was constructed containing SwARS2, the 
ColE replication origin of Exgli, a suitable selective marker for S. occjdentajis (e.g. TRPS, HIS4, LEU2) and a cassette 
containing the T7 polymerase promoter and terminator sequences isolated from piasmid pAR2529 (Fuerst et a!., 1986), 
separated by the polylinker sequence from M1 3mp1 9. To study the efficiency of the T7 system we expressed a GAM/ 
30 celD fusion (GAM signal sequence and structural celD gene obtained as a BamHI-Asp 718 fragment from pMPGCI -2) 
under control of the T7 promoter by insertion into the BamHI-Asp718 site of the polylinker sequence. Using this system 
active cellulase is secreted into the culture medium. 

MATERIALS AND_METHOpS 

35 

1 . The microorganisms used are as follows : 

o 

Schwannjomyces occidentalis NGA23 : (DSM 3792) 



40 Sc h wan nj omyc es_occ ide nta lis NGA58 



45 



so 



Kju^erpmyces lactjs SD11 : (DSM 3795) 

Schizosaccharpmyc^^ojnbe LEU 1-32, HIS5-303 (DSM 3796) 

Sacchajqmyces cerevisjae YNN27 (URA3-52, TRP1 -298, GAL2) 

The Eschejichia cpli strain used is as follows E. coli HB1 01 F-. hsd S20 (r B - m B -), recAJ 3, ara_-J 4, proA2, JacYJ , 
pal_K_2, rpVL20 \sr^)'x^\-_5 t mtM , sup_E44) 

2. Culture media : 



Schwannjqmyces .occidentals strain NGA58 (isolated by UV mutagenesis of strain NGA23) was grown at 30°C in 
YNB (0.67% yeast nitrogen base without amino acids) supplemented with appropriate amino acids and with 2% of 
55 soluble starch when indicated. 

_S^ cereyisiae strain YNN27 (_U_R_A3;52 ( TRPJ : 2_8_9, GAL2) (Stinchomb et al 1 980); jC [actis strain SD11 JRPJ (Das 
and HoMenbe~rg,~1982); S. pombe strains LEU1-32, HIS5-303 (obtained from W.D. Heyer): These strains were grown 
under conditions as described above for Schwannjqmyces occidentalis with the replacement of 2% soluble starch by 
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2% glucose, where indicated. 

The culture media for S_ pccjdenjali_s and Hansenuja pplyrnprpjia were buffered with 0.2 M NaP0 4 buffer pH 6.2; 
those for S. cereyisiae and K. [actjs were buffered with 0.1 M citrate buffer pH 6 and those for S. pombe with 0.05 M 
acetate buffer pH 6. 

s The pH optimum for the ceilulase (Joliff et al., 1 986a), a-amylase and glucoamylase (Wilson and ingledew, 1 982) 

is approx. pH 6. 

EiCglLHBlQI (Bolivar et al., 1977) was grown in LB medium (Maniatis et al., 1982) with penicillin G (150 ug/ml). 
3. Mjsc_eHanepus metho_ds 

10 

Plasmid DNA was purified either by CsCI gradient centrifugation (Maniatis et al., 1982) or by rapid alkaline extrac- 
tion of plasmid DNA described by Birnboim and Doly (1979). 

Yeast crude extracts were prepared by the method of Schatz (1979). Yeast minilysates were prepared by the 
method of Sherman et al., (1983). 
is DNA fragments were isolated by the "low melting agarose" procedure as described by Gafner et al., (1 983). South- 

ern hybridization was carried out as described by Southern (1 975). 

Yeast transformation was performed according to Klebe et al. (1983) or Ito et al. (1983). The transformants or 
integrants were tested for expression and secretion of ceilulase using either the qualitative congo red assay (Teather 
and Wood, 1 982: halo formation on carboxymethylcellulose agar plates) or a quantitative ceilulase assay according to 
20 Joliff et al. (1 986a). One unit (U) is defined as the amount of enzyme that released 1 umol para-nitrophenol per min at 
60°C. Western analysis was performed as described by Towbin et al. (1979). 

The transformants or integrants were tested for expression and secretion of a-amylase using either a standardized 
a-amylase enzyme test from Merck, W. Germany or a qualitative starch degradation test (halo formation on starch 
agar plates after staining with iodine), as follows : in this test the rate of formation of 2-chloro-4-nitrophenol is determined 
25 by photometry at 405 nm in 0.1 M potassium phosphate at 37°C. The enzyme unit (U) is defined as the amount of 
enzyme catalysing the formation of 1 9imol 2-chloro-4-nitrophenol per min at 37°C. 

Glucoamylase activity was measured by a stop assay method: 
After incubation of the sample in 10% soluble starch in 0.05M KH 2 P0 4 -NaOH (pH 5.0) at 50°C the amount of glucose 
produced is determined by the glucose dehydrogenase method (system glucose Merck). The quantity of NADH formed 
30 is proportional to the glucose concentration. The enzyme unit (U) is defined as the amount of enzyme catalysing the 
formation of 1 umol glucose/min at 50° C. 



TABLE 1 



Transform ant 


OD 600 nm 


EDG-activity mU/ml 


NGA58: 


2.3 


0.12 


pMPGCM (0.3 kb promoter) 


5.3 


0.73 


NGA58: 


2.0 


0.53 


pMPGC1-2 (1.3 kb GAM promoter) 


7.9 


1.83 


NGA58: 


3.0 


0.52 


pMPAGC ADH1 promoter 


5.9 


1.07 


NGA58: 


2.2* 


0.40* 


pMPPGC 


4.5* 


0.64* 


PDC1 -promoter 


2.9 


0.19 




7.2 


0.31 


Transformants were grown in YNB, 0.5% Casaminoacids, 2% maltose, 20 mg/l adenine and 0.2 M NaP buffer pH 6.2 



* : values for transformants grown in 4% glucose 
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40 Claims 

Claims for the following Contracting States : AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SB 

45 1 . A yeast cell of the genus Schwanniomyces , 
characterized in that 

said yeast cell contains at least one expression cassette comprising : 

a) a first DNA sequence serving as a regulon, 

so 

b) optionally a second DNA sequence coding for a signal peptide 

c) a third DNA sequence coding for a foreign protein, and 
55 d) optionally a fourth DNA sequence serving as a terminator, 

wherein the first, second and fourth DNA sequence are derived from yeast. 
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2. The yeast cell according to claim 1 , 
characterlz d In that 

said first and/or second and/or fourth DNA sequence are derived from a gene coding for an amylolytic enzyme. 

s 3. The yeast cell according to claim 2, 
characterized in that 

said amylolytic enzyme is an a-amylase or glucoamylase. 

4. The yeast cell according to any of claims 1 to 3, 
io characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera Debar- 
iomvces , Saccharomycopses , Lipomyces, Pichia , or Saccharomyces , preferably Schwanniomyces. 

5. The yeast cell according to claim 4, 
is characterized in that 

said yeast is Schwanniomyces occidentalis. 

6. The yeast cell according to any of claims 1 to 5, 
characterized In that 

20 said first DNA sequence is comprised by a 1 .8 kb Bglil -Xho l-fragment, if derived from Schwanniomyces occidentalis 

a-amylase gene, or a 1 .3 kb Bam HI -Pvu ll fragment if derived from Schwanniomyces occidentalis glucoamylase 
gene. 

7. The yeast cell according to claim 5, 
25 characterized in that 

said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 of the 
region preceeding the structural gene coding for a-amylase or part or all of the DNA sequence corresponding to 
bases -1 to -320 of the region preceeding the structural gene coding for glucoamylase. 

30 8. The yeast cell according to any of claims 1 to 5, 
characterized in that 

said first DNA sequence corresponds to one of the regulons regulating expression of: 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, GAPDH, wherein the regulon is preferably obtained from Saccharo- 
35 myces cerevisiae, 

b) LAC4, wherein the regulon is preferably obtained from Kluvveromyces lactis, or 

c) corresponding regulons, preferably obtained from Schwanniomyces. 

40 

9. The yeast cell according to any of claims 1 to 8, characterized in that said second DNA sequence is coding for 
all or part of at least one of the following peptides : 

45 a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 

LeuAlaPheSerArgLeuVa lSer Ala ; 



50 



55 



b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 
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c ) Met I lePheLeuLysLeuI leLy sSer I leva 1 1 leGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla; 

e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. A yeast cell according to claim 9, 
characterized In that 

the DNA sequence coding for said peptide corresponds to part or all of one of the following DNA sequences: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 

TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



C ) ATGATTTTTCTGAAG CTG ATTAAAAGTAT AGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAG CTG ATTAAAAGTAT AGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 

11. The yeast cell according to any of claims 1 to 10, 
characterized In that 

said third DNA sequence coding for a foreign protein is a natural or synthetic DNA sequence encoding a cellulase, 
interleukin, insulin like growth factor, interferon, lymphokine, human growth factor, nerve growth factor, aprotinin, 
insulin, hirudin, hormones, blood clotting factors, hepatitis B surface or core antigens, viral or bacterial vaccines 
or human granulocyte/macrophage colony stimulating factor. 

12. The yeast cell according to any of claims 1 to 11 , 
characterized in that 
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said fourth DNA sequence originates from a yeast of a member of the genera Saccharomvces, Pichia , Hansenula , 
preferably Schwanniomvces . 

13. The yeast cell according to claim 12, 
5 characterized in that 

said fourth DNA sequence corresponds to part or all of the terminator comprised by nucleotides 1537 to 1740 of 
the Schwanniomvces ct-amylase gene or part or all of the terminator comprised by nucleotides 2875 to 3320 of 
the Schwanniomvces glucoamylase gene. 

to 14. The yeast cell according to any of claims 1 to 1 3, 
characterized in that 

said first and/or second and/or third and/or fourth DNA sequence has been modified by insertion or deletion or 
substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or improved. 

is 15. The yeast cell according to any of claims 1 to 14, 
characterized In that 

said expression cassette is carried by a circular or linear vector, which optionally comprises one or more selective 
marker genes. 

20 16. The yeast cell according to claim 15, 
characterized in that 

said vector comprises any of the following selective marker genes : TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, 
LYS2, preferably obtained from Saccharomyces, Hansenula Pichia or Schwanniomvces , AMY1 or GAM1 , prefer- 
ably obtained from Schwanniomvces, 

25 

17. The yeast cell according to any of claims 15 and 16, 
characterized in that 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable mainte- 
nance of the vector in the host organism. 

30 

18. The yeast cell according to claim 17 
characterized In that 

said DNA sequence is derived from the genome of a member of the genera Saccharomvces , Pichia , Hansenula 
or Kluweromvces , preferably Schwanniomvces . 

3$ 

19. A yeast cell according to any of claims 17 or 18, 
characterized in that 

said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS derived 
from the respective host organism. 

40 

20. The yeast cell according to claim 1 9, 
characterized In that 

said DNA sequence is SwARS 1 or SwARS2, capable of controlling autonomous replication and stable maintenance 
of the vector in Schwanniomvces , wherein SwARS 1 is comprised on a Clal-BamHI fragment adjacent to the EcoRI 
45 fragment shown in Fig. 2a, and SwARS2 is comprised on an EcoRI-Bglll fragment contained in the EcoRI fragment 

shown in Fig. 2a. 

21. The yeast cell according'to any of claims 17 to 20, 
characterized in that 

so said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, while 

retaining or improving its biological function. 

22. The yeast cell according to any of claims 1 to 21 , 
characterized in that 

55 said expression cassette is integrated into the genome, optionally in multiple copies, preferably via homologous 

recombination. 

23. A process for the production of polypeptide, wherein a yeast host organism is cultivated under suitable conditions 
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and the polypeptide is recovered in a manner known per se, 
charact rlz d In that 

said yeast host organism is a yeast cell according to any of claims 1 to 22. 

5 24. The process according to claim 23 
characterized in that 

yeast cells of the species Schwanniomyces occidentalis are used as a host organism and are preferably cultivated 
on starch dextrin, maltose and/or plant biomass. 

10 25. The process according to any of claims 23 or 24, 
characterized in that 

yeast cells of the genus Schwanniomyces are used for the production of single cell protein. 

26. The process according to any of claims 23 to 25, 
1$ characterized In that 

said polypeptide is amylase or glucoamylase. 

27. Use of a yeast cell according to any of claim 1 to 22 for transformation and expression of foreign genes. 
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Claims for the following Contracting State: ES 



1 . A process for preparing a yeast cell capable of expressing foreign protein, 
characterized in that 

25 at least one expression cassette comprising: 

a) a first DNA sequence serving as a regulon, 

b) optionally a second DNA sequence coding for a signal peptide 

30 

c) a third DNA sequence coding for a foreign protein, and 

d) optionally a fourth DNA sequence serving as a terminator, 

35 is inserted into a yeast cell of the genus Schwanniomyces, wherein the first, second and fourth DNA sequence 

are derived from yeast. 

2. The process according to claim 1 , 
characterized In that 

40 said first and/or second and/or fourth DNA sequence are derived from a gene coding for a amylolytic enzyme. 

3. The process according to claim 2, 
characterized in that 

said amylolytic enzyme is an a-amyiase or glucoamylase. 

4. The process according to any of claims 1 to 3, 
characterized in that 

said first and/or second and/or fourth DNA sequences are derived from a yeast of a member of the genera Debar- 
iomvces , Saccharomvcopses , Lipomyces , Pichia , or Saccharomyces , preferably Schwanniomyces . 

5. The process according to claim 4, 
characterized in that 

said yeast is Schwanniomyces occidentalis . 

55 6. The process according to any of claims 1 to 5, 
characterized In that 

said first DNA sequence is comprised by a 1 .8 kb Bglll -Xho l-fragment, if derived from Schwanniomyces occidentalis 
a-amylase gene, or a 1.3 kb Bam HI -Pvu ll fragment if derived from Schwanniomyces occidentalis glucoamylase 
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gene. 

7. The process according to claim 5, 
characterized in that 

said first DNA sequence is comprised by part or all of the DNA sequence corresponding to bases -1 to -540 of the 
region proceeding the structural gene coding for a-amylase or part or all of the DNA sequence corresponding to 
bases -1 to -320 of the region proceeding the structural gene coding for glucoamylase. 

8. The process according to any of claims 1 to 5, 
characterized In that 

said first DNA sequence corresponds to one of the regulons regulating expression of: 

a) ADH1.ADH2, PDC1, GAL1/10, PGK, GAPDH, wherein the regulon is preferably obtained from Saccharo- 
myces cerevisiae, 

b) LAC4, wherein the regulon is preferably obtained from Kluyveromyces lactis, or 

c) corresponding regulons, preferably obtained from SchwannjqmYces. 

9. The process according to any of claims 1 to 8, 
characterized in that 

said second DNA sequence is coding for all or part of at least one of the following peptides : 

a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAla ; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla; 

e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. A process according to claim 9, 
characterized in that 

the DNA sequence coding for said peptide corresponds to part or all of one of the following DNA sequences: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 
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b ) ATGAGATTTTCAACTG AAGG ATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 



c ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTG AAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 

11. The process according to any of claims 1 to 10, 
characterized in that 

said third DNA sequence coding for a foreign protein is a natural or synthetic DNA sequence encoding a cellulase, 
inter leukin , insulin like growth factor, interferon, lymphokine, human growth factor, nerve growth factor, aprotinin, 
insulin, hirudin, hormones, btood clotting factors, hepatitis B surface or core antigens, viral or bacterial vaccines 
or human granulocyte/macrophage colony stimulating factor. 

12. The process according to any of claims 1 to 11 , 
characterized in that 

said fourth DNA sequence originates from a yeast of a member of the genera Saccharomvces , Pichia , Hansen ula , 
preferably Schwanniomyces . 

13. The process according to claim 12, 
characterized in that 

said fourth DNA sequence corresponds to part or all of the terminator comprised by nucleotides 1537 to 1740 of 
the Schwanniomyces a-amylase gene or part or all of the terminator comprised by nucleotides 2875 to 3320 of 
the Schwanniomyces glucoamytase gene. 

14. The process according to any of claims 1 to 1 3, 
characterized in that 

said first and/or second and/or third and/or fourth DNA sequence has been modified by insertion or deletion or 
substitution of one or more nucleotides while the biological activity of the DNA sequence is retained or improved. 

15. The process according to any of claims 1 to 14, 
characterized in that 

said expression cassette is carried by a circular or linear vector, which optionally comprises one or more selective 
marker genes. 

16. The process according to claim 15, 
characterized in that 

said vector comprises any of the following selective marker genes : TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, 
LYS2, preferably obtained from Saccharomvces , Hansenula Pichia or Schwanniomyces , AMY1 or GAM1 , prefer- 
ably obtained from Schwanniomyces . 

17. The process according to any of claims 1 5 and 16, 
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charact rlz din that 

said vector further comprises a DNA sequence capable of controlling autonomous replication and stable mainte- 
nance of the vector in the host organism. 

18. The process according to claim 17 
characterized in that 

said DNA sequence is derived from the genome of a member of the genera Saccharomyces , Pichia , Hansenula 
or Kluyveromyces , preferably Schwanniomyces. 

19. A process according to any of claims 17 or 18, 
characterized in that 

said DNA sequence is selected from the autonomously replicating sequences ARS, preferably an ARS derived 
from the respective host organism. 

20. The process according to claim 1 9, 
characterized in that 

said DNA sequence is SwARSI or SwARS2, capable of controlling autonomous replication and stable maintenance 
of the vector in Schwanniomyces, wherein SwARSI is comprised on a Clal-BamHI fragment adjacent to the EcoRI 
fragment shown in Fig. 2a, and SwARS2 is comprised on an EcoRI-Bglll fragment contained in the EcoRI fragment 
shown in Fig 2a. 

21. The process according to any of claims 1 7 to 20, 
characterized In that 

said DNA sequence has been modified by deletion, insertion or substitution of one or more nucleotides, while 
retaining or improving its biological function. 

22. The process according to any of claims 1 to 21, 
characterized in that 

said expression cassette is integrated into the genome, optionally in multiple copies, preferably via homologous 
recombination. 

23. A process for the production of polypeptide, wherein a yeast host organism is cultivated under suitable conditions 
and the polypeptide is recovered in a manner known per se, 

characterized in that 

said yeast host organism is a yeast cell according to any of claims 1 to 22. 

24. The process according to claim 23 
characterized In that 

yeast cells of the species Schwanniomyces occidentalis are used as a host organism and are preferably cultivated 
on starch dextrin, maltose and/or plant biomass. 

25. The process according to any of claims 23 or 24, 
characterized in that 

yeast cells of the genus Schwanniomyces are used for the production of single cell protein. 

26. The process according to any of claims 23 to 25, 
characterized In that 

said polypeptide is amylase or glucoamylase. 

27. Use of a yeast cell according to any of claim 1 to 22 for transformation and expression of foreign genes. 



Patentanspruche 

Patentanspruch fur folgende Vertragsstaaten: AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SE 

1. Hefezelle der Gattung Schwanniomyces, dadurch gekennzeichnet, daB die Hefezelle mindestens eine Expressi- 
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10 



so 



onskassette enthalt, umfassend: 

a) eine erste DNA-Sequenz, die als Regulon dient, 

b) gegebenenfatls eine zweite DNA-Sequenz, die fur ein Signalpeptid kodiert, 

c) eine dritte DNA-Sequenz, die fur ein Fremdprotein kodiert, und 

d) gegebenenfalls eine vierte DNA-Sequenz, die als ein Terminator dient, 
wobei die erste, zweite und vierte DNA-Sequenz von Hefe abgeleitet sind. 



2. Hefezelle nach Anspruch 1 , dadurch gekennzeichnet, daG die erste und/oder zweite und/oder vierte DNA-Sequenz 
von einem fur ein amylolytisches Enzym kodierenden Gen abgeleitet sind. 

15 

3. Hefezelle nach Anspruch 2, dadurch gekennzeichnet, daG das amylolytische Enzym eine ct-Amylase oder Gluco- 
amylase ist. 

4. Hefezelle nach einem der Anspruche 1 bis 3, dadurch gekennzeichnet, daG die ersten und/oder zweiten und/oder 
20 vierten DNA-Sequenzen von einer Hefe eines Angehdrigen der Gattunqen Debariomvces , Saccharomycopses , 

Lipomyces , Pichia oder Saccharomvces, bevorzugt Schwanniomvces , abgeleitet sind. 

5. Hefezelle nach Anspruch 4, dadurch gekennzeichnet, daG die Hefe Schwanniomvces occidentalis ist. 

25 6. Hefezelle nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daG die erste DNA-Sequenz von einem 
1,8 kb BpJI I -Xhol -Fragment umfaGt ist, wenn sie vom Schwanniomvces occidentalis -a-Amylasegen abgeleitet ist, 
oder von einem 1,3 kb Bam HI -Pvu il -Fragment, wenn sie von einem Schwanniomvces occidentalis -Glucoamyla- 
segen abgeleitet ist. 

30 7. Hefezelle nach Anspruch 5, dadurch gekennzeichnet, daG die erste DNA-Sequenz von einem Teil oder der ge- 
samten DNA-Sequenz umlaGt ist, die den Basen -1 bis -540 des Bereiches entspricht, der dem fur ot-Amylase 
kodierenden Strukturgen vorangeht, oder einem Teil oder der Gesamtheit der DNA-Sequenz, die den Basen -1 
bis -320 des Bereiches entspricht, der dem fur Glucoamylase kodierenden Strukturgen vorangeht. 

35 8. Hefezelle nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daG die erste DNA-Sequenz einem der 
Regulons entspricht, die die Expression regulieren von 

a) ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, wobei das Regulon bevorzugt von Saccharomvces cerevi- 
siae erhalten ist, 

40 

b) LAC4, wobei das Regulon bevorzugt von Kluyveromvces lactis erhalten ist, oder 

c) entsprechenden Regulons, bevorzugt aus Schwanniomvces erhalten. 

45 9. Hefezelle nach einem der Anspruche 1 bis 8, dadurch gekennzeichnet, daG die zweite DNA-Sequenz fur die Ge- 
samtheit oder einen Teil mindestens eines der folgenden Peptide kodiert: 



a) MetArgPheSerThrGluGlyPheThrSerLysValVaiAlaAlalle 
LeuAlaPheSarArgLeuValSerAla ; 



5 5 b) MetArgPheSerThrGluGlyPheThrSerLysValValAIaAlaXl® 

LeuAlaPheSerArgLeuValSerAlaGlnProIleIlePh<aAspMetArg; 
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c) MetllePheLeuLysLeuIleLysSerlleVallleGlyL uGlyLeu 
ValS rAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLduGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 

e) MetllaPheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Hefezelle nach Anspruch 9, dadurch gekennzeichnet, daB die fur das Pepttd kodierende DNA-Sequenz einem Teil 
oder der Gesamtheit einer der folgenden DNA-Sequenzen entspricht: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGGAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

c ) ATGATTTTTCTGAAGCTG ATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT ; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTGAAGCTG ATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 

11. Hefezelle nach einem der Anspruche 1 bis 10, dadurch gekennzeichnet, daB die dritte DNA-Sequenz, die fur ein 
Fremdprotein kodiert, eine natOrliche oder synthetische DNA-Sequenz ist, die fur eine Zellulase, Interleukin, insu- 
linahnlichen Wachstumsfaktor, Interferon, Lymphokin, menschlichen Wachstumsfaktor, Nervenwachstumsfaktor, 
Aprotinin, Insulin, Hirudin, Hormone, Blutgerinnungsfaktoren, Hepatitts-B-Oberflachen-oder-Kernantigene, virale 
oder bakterielle Impfstoffe oder humanen Granulocyten/Makrophagen-Kolonie-stimulierenden Faktor kodiert. 

12. Hefezelle nach einem der Anspruche 1 bis 11, dadurch gekennzeichnet, daB die vierte DNA-Sequenz von einer 
Hefe eines Angehorigen der Gattungen Saccharomyces , Pichia , Hansenula , bevorzuat Schwanniomyces , stammt. 

13. Hefezelle nach Anspruch 12, dadurch gekennzeichnet, daB die vierte DNA-Sequenz einem Teil oder der Gesamt- 
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heit des Terminators entspricht, der von den Nukleotiden 1537 bis 1740 des Schwanniomvces -a-Amvlasegenes 
umfa3t ist, Oder einem Teil Oder der Gesamtheit des Terminators, der von den Nukleotiden 2875 bis 3320 des 
Schwanniomyces-Glucoamylasegenes umfaBt ist. 

14. Hefezelle nach einem der Anspruche 1 bis 13, dadurch gekennzeichnet, daB die erste und/oder zweite und/oder 
dritte und/oder vierte DNA-Sequenz durch Insertion oder Deletion oder Substitution eines oder mehrerer Nukleo- 
tide modifiziert worden ist, wahrend die biologische Aktivitat der DNA-Sequenz beibehalten oder verbessert wird. 

15. Hefezelle nach einem der Anspruche 1 bis 14, dadurch gekennzeichnet, daB die Expressionskassette von einem 
zirkularen oder linearen Vektor getragen wird, der gegebenenfalls ein oder mehrere selektive Markergene umfaBt. 

16. Hefezelle nach Anspruch 1 5 ( dadurch gekennzeichnet, daB der Vektor eines der folgenden selektiven Markergene 
umfaBt: 

TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, bevorzuQt erhalten von Saccharomyces , Hansenula . Pichia 
oder Schwanniomyces , AMY1 oder GAM1, bevorzugt erhalten von Schwanniomyces. 

17. Hefezelle nach einem der Anspruche 15 und 16, dadurch gekennzeichnet, daB der Vektor weiter eine DNA-Se- 
quenz umfaBt, die die autonome Replikation und stabile Beibehaltung des Vektors in dem Wirtsorganismus steuern 
kann. 

18. Hefezelle nach Anspruch 17, dadurch gekennzeichnet, daB die DNA-Sequenz von dem Genom eines Angehortgen 
der Gattungen Saccharomyces , Pichia , Hansenula oder Kluyveromyces abgeleitet ist, bevorzugt Schwannio- 
myces. 

19. Hefezelle nach einem der Anspruche 17 oder 18, dadurch gekennzeichnet, daB die DNA-Sequenz aus den auto- 
nom replizierenden Sequenzen ARS, bevorzugt einer ARS, die von dem jeweiligen Wirtsorganismus abgeleitet 
ist, ausgewahlt ist. 

20. Hefezelle nach Anspruch 1 9, dadurch gekennzeichnet, daB die DNA-Sequenz SwARSI oder SwARS2 ist, die die 
autonome Replikation und stabile Beibehaltung des Vektors in Schwanniomyces steuern konnen, wobei SwARSI 
von einem Clal -Bam HI-Fragment umfaBt ist, das dem in Figur 2a gezeigten EcoRI -Fragment benachbart ist, und 
SwARS2 von einem EcoRI- BgJI I -Fragment umfaBt ist, das in dem in F ig u r 2a gezeigten EcoRI- Fragment enthalten 
ist. 

21. Hefezelle nach einem der Anspruche 17 bis 20, dadurch gekennzeichnet, daB die DNA-Sequenz durch Deletion, 
Insertion oder Substitution eines oder mehrerer Nukleotide modifiziert worden ist, wahrend ihre biologische Funk- 
tion beibehalten oder verbessert ist. 

22. Hefezelle nach einem der Anspruche 1 bis 21 , dadurch gekennzeichnet, daB die Expressionskassette in das Ge- 
nom integriert wird, gegebenenfalls in mehreren Kopien, bevorzugt uber homologe Rekombination. 

23. Verfahren zum Erzeugen von Polypeptid, wobei ein Hefewirtsorganismus unter geeigneten Bedingungen kultiviert 
und das Polypeptid in an sich bekannter Weise gewonnen wird, dadurch gekennzeichnet, daB der Hefeorganismus 
eine Hefezelle gemaB einem der Anspruche 1 bis 22 ist. 

24. Verfahren nach Anspruch 23, dadurch gekennzeichnet, daB Hefezellen der Art Schwanniomyces occidentalis als 
ein Wirtsorganismus verwendet werden und bevorzugt auf Starke, Dextrin, Maltose und/oder Pflanzenbiomasse 
kultiviert werden. 

25. Verfahren nach einem der Anspruche 23 oder 24, dadurch gekennzeichnet, daB Hefezellen der Gattung Schwan- 
niomyces fur die Erzeugung von Einzelzellprotein verwendet werden. 

26. Verfahren nach einem der Anspruche 23 bis 25, dadurch gekennzeichnet, daB das Polypeptid Amylase oder Gly- 
coamylase ist. 

27. Verwendung einer Hefezelle nach einem der Anspruche 1 bis 22 zur Transformation und Expression von Fremd- 
genen. 
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Patentanspruch f Or folgenden Vertragsstaat: ES 

1 . Verfahren zum Herstellen einer Hef ezelle, die Fremdprotein exprimieren kann, dadurch gekennzeichnet, daB min- 
destens eine Expressionskassette in eine Hefezelle der Gattung Schwann iomyces insertiert wird, wobei die Ex- 
pressionskassette umfaBt: 

a) eine erste DNA-Sequenz, die als Reguion dient, 

b) gegebenenfalls eine zweite DNA-Sequenz, die fur ein Signalpeptid kodiert, 

c) eine dritte DNA-Sequenz, die fur ein Fremdprotein kodiert, und 

d) gegebenenfalls eine vierte DNA-Sequenz, die als Terminator dient, 
wobei die erste, zweite und vierte DNA-Sequenz von Hefe abgeleitet sind. 

2. Verfahren nach Anspruch 1, dadurch gekennzeichnet, daB die erste und/oder zweite und/oder vierte DNA-Sequenz 
von einem fur ein amylolytisches Enzym kodierenden Gen abgeleitet sind. 

3. Verfahren nach Anspruch 2, dadurch gekennzeichnet, daB das amylolytische Enzym eine a-Amylase Oder Glu- 
coamylase ist. 

4. Verfahren nach einem der Anspruche 1 bis 3, dadurch gekennzeichnet, daB die ersten und/oder zweiten und/oder 
vierten DNA-Sequenzen von einer Hefe eines Angehdrigen der Gattungen Debariomyces , Saccharomycopses , 
Lipomvces , Pichia oder Saccharomyces, bevorzugt Schwanniomyces, abgeleitet sind. 

5. Verfahren nach Anspruch 4, dadurch gekennzeichnet, daB die Hefe Schwanniomyces occidentalis ist. 

6. Verfahren nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem 
1,8 kb Bgll I -Xhol -Fragment umfaBt ist, wenn sie vom Schwanniomyces occidentalis -a-Amylasegen abgeleitet ist, 
oder von einem 1,3 kb BamHI-Pvu 1 1 -Fragment umfaBt ist, wenn sie vom Schwanniomyces occidental is -Glucoa- 
mylasegen abgeleitet ist. 

7. Verfahren nach Anspruch 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz von einem Teil oder der Ge- 
samtheit der DNA-Sequenz umfaBt ist, die den Basen -1 bis -540 des Bereiches entspricht, der dem fur a-Amylase 
kodierenden Strukturgen vorangeht, oder einem Teit oder der Gesamtheit der DNA-Sequenz, die den Basen -1 
bis -320 des Bereichs entspricht, der dem fur Glucoamylase kodierenden Strukturgen vorangeht. 

8. Verfahren nach einem der Anspruche 1 bis 5, dadurch gekennzeichnet, daB die erste DNA-Sequenz einem der 
Regulons entspricht, die die Expression regulieren von 

a) ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, wobei das Reguion bevorzugt von Saccharomyces cerevi- 
siae erhalten ist, 

b) LAC4, wobei das Reguion bevorzugt von Klyveromyces lactis erhalten ist, oder 

c) korrespondierenden Regulons, bevorzugt von Schwanniomyces erhalten. 

9. Verfahren nach einem der Anspruche 1 bis 8, dadurch gekennzeichnet, daB die zweite DNA-Sequenz fur die 
Gesamtheit oder einen Teil mindestens eines der folgenden Peptide kodiert: 

a) MetArgPheSerThrGluGlyPheThrS^rLysValValAlaAlall© 
LeuAlaPheSerArgLeuValSerAla; 



26 



EP 0 394 538 B1 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLaulleLysSerlleVallleGlyLeuGlyLau 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLau 
ValSerAlalleGlnAlaAlaProAla ; 

a) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Verfahren nach Anspruch 9, dadurch gekennzeichnet, dafi die fur das Peptid kodierende DNA-Sequenz einem 
Teil Oder der Gesamtheit einer der folgenden DNA-Sequenzen entspricht: 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

C ) ATG ATTTTTCTG AAG CT G ATTAAAAGT ATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 

GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 

11. Verfahren nach einem der Anspruche 1 bis 10, dadurch gekennzeichnet, daB die dritte DNA-Sequenz, die fur ein 
Fremdprotein kodiert, eine naturliche cder synthetische DNA-Sequenz ist, die fur eine Zellulase, Interleukin, insu- 
linahnlichen Wachstumsfaktor, Interferon, Lymphokin, menschlichen Wachstumsfaktor, Nervenwachstumsfaktor, 



27 



EP 0 394 538 B1 



Aprotinin, Insulin, Hirudin, Hormone, Blutgerinnungsfaktoren, Hepatitis-B-Oberflachen- Oder -Kernantigene, virale 
Oder bakterielle Impfstoffe Oder humanen Granulocyten/Makrophagen-Kolonie-stimulierenden Faktor kodiert. 

12. Verfahren nach einem der Anspruche 1 bis 11, dadurch gekennzeichnet, daB die vierte DNA-Sequenz von einer 
5 Hef e eines Angehdrigen der Gattungen Saccharomyces , Pichia , Hansenula , bevorzugt Schwann iomvces, stammt. 

13. Verfahren nach Anspruch 1 2, dadurch gekennzeichnet, daB die vierte DNA-Sequenz einem Teil oder der Gesamt- 
heit des Terminators entspricht, der von den Nukleotiden 1537 bis 1740 des Schwanniomvces -a-Amylasegenes 
umfaBt ist, oder einem Teit oder der Gesamtheit des Terminators, der von den Nukleotiden 2875 bis 3320 des 

10 Sch wan n iom yc es -G lucoam y lase g en es umfaBt ist. 

14. Verfahren nach einem der Anspruche 1 bis 13, dadurch gekennzeichnet, daft die erste und/oder zweite und/oder 
dritte und/oder vierte DNA-Sequenz durch Insertion oder Deletion oder Substitution eines oder mehrerer Nukleo- 
tide modifiziert worden ist ( wahrend die biologische Aktivitat der DNA-Sequenz beibehalten oder verbessert wird. 

15 

15. Verfahren nach einem der Anspruche 1 bis 14, dadurch gekennzeichnet, daB die Expressionskassette von einem 
zirkularen oder linearen Vektor getragen wird, der gegebenenfalls ein oder mehrere selektive Markergene umfaBt. 

16. Verfahren nach Anspruch 15, dadurch gekennzeichnet, daB der Vektor eines der folgenden selektiven Markergene 
20 umfaBt: 

TRP5, LEU2, ADE1 , ADE2, HIS3, HIS4, URA3, LYS2, bevorzugt erhalten von Saccharomyces , Hansenula , Pichia 
oder Schwanniomyces, AMY1 oder GAM1, bevorzugt erhalten von Schwanniomvces. 

17. Verfahren nach einem der Anspruche 15 und 16, dadurch gekennzeichnet, daB der Vektor weiter eine DNA-Se- 
25 quenz umfaBt, die die autonome Replikation und stabile Beibehaltung des Vektors in dem Wirtsorganismus steuern 

kann. 

18. Verfahren nach Anspruch 1 7, dadurch gekennzeichnet, daB die DNA-Sequenz von dem Genom eines Angehdrigen 
der Gattungen Saccharomyces , Pichia , Hansenula oder Kluyveromvces abgeleitet ist, bevorzugt Schwann io- 

30 mvces. 

19. Verfahren nach einem der Anspruche 17 oder 18, dadurch gekennzeichnet, daB die DNA-Sequenz aus den au- 
tonom repiizierenden Sequenzen ARS, bevorzugt einer ARS, die von dem jeweiligen Wirtsorganismus abgeleitet 
ist, ausgewahlt wird. 

35 

20. Verfahren nach Anspruch 1 9, dadurch gekennzeichnet, daB die DNA-Sequenz SwARSI oder SwARS2 ist, die die 
autonome Replikation und stabile Beibehaltung des Vektors in Schwanniomvces steuern konnen, wobei SwARSI 
von einem Clal -Bam HI-Fragment umfaBt ist, das dem in Figur 2a gezeigten EcoRI -Fragment benachbart ist, und 
SwARS2 von einem EcoRI -Bgll I -Fragment umfaBt ist, das in dem in Figur 2a gezeigten EcoRI-Fragment enthalten 

40 ist. 

21. Verfahren nach einem der Anspruche 17 bis 20, dadurch gekennzeichnet, daB die DNA-Sequenz durch Deletion, 
Insertion oder Substitution eines oder mehrerer Nukleotide modifiziert worden ist, wahrend ihre biologische Funk- 
tion beibehalten oder verbessert wird. 

45 

22. Verfahren nach einem der Anspruche 1 bis 21, dadurch gekennzeichnet, daB die Expressionskassette in das 
Genom integriert ist, gegebenenfalls in mehrfachen Kopien, bevorzugt uber homologe Rekombination. 

23. Verfahren zum Erzeugen eines Polypeptides, wobei ein Hefewirtsorganismus unter geeigneten Bedingungen kul- 
so tiviert und das Polypeptid in an sich bekannter Weise gewonnen wird, dadurch gekennzeichnet, daB der Hefe- 
wirtsorganismus eine Hefezelle gemaB einem der Anspruche 1 bis 22 ist. 

24. Verfahren nach Anspruch 23, dadurch gekennzeichnet, daB Hefezellen der Art Schwanniomvces occidentalis als 
ein Wirtsorganismus verwendet werden und bevorzugt auf Starke, Dextrin, Maltose und/oder Pflanzenbiomasse 

55 kultiviert werden. 

25. Verfahren nach einem der Anspruche 23 oder 24, dadurch gekennzeichnet, daB Hefezellen der Gattung Schwan- 
niomvces fur die Erzeugung von Einzelzellprotein verwendet werden. 
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26. Verfahren nach einem der Anspruche 23 bis 25, dadurch gekennzeichnet, daG das Polypeptid Amylase Oder Gly- 
coamylase ist. 

27. Verwendung einer Hefezelle gemaB einem der Anspruche 1 bis 22 zur Transformation und Expression von Fremd- 
s genen. 



Revendications 

10 

Revendicatlons pour lea Etats contractants suivants : AT, BE, CH, LI, DE, FR, GB, GR, IT, LU, NL, SE 

1. Cellule de levure du genre Schwanniomyces, caracterisee en ce que ladite cellule de levure contient au moins 
une cassette d'expression comprenant : 

15 

a) une premiere sequence d'ADN-servant de reguion, 

b) facultativement, une deuxieme sequence d'ADN codant pour un peptide signal 

c) une troisieme sequence d'ADN codant pour une proline etrangere, et 

d) facultativement, une quatrieme sequence d'ADN servant de terminateur, 

20 

les premiere, deuxieme et quatrieme sequences d'ADN etant obtenues a partir d'une levure. 

2. Cellule de levure selon la revendication 1 , caracterisee en ce que lesdites premiere et/ou deuxieme et/ou qua- 
trieme sequences d'ADN sont obtenues a partir d'un gene codant pour une enzyme amylolytique. 

25 

3. Cellule de levure selon la revendication 2, caracterisee en ce que ladite enzyme amylolytique est une ct-amylase 
ou la glucoamylase. 

4. Cellule de levure selon Tune quelconque des revendications 1 a 3, caracterisee en ce que lesdites premiere et/ 
30 ou deuxieme et/ou quatrieme sequences d'ADN sont obtenues a partir d'une levure d'un 6l6ment des genres 

Debariomyces, Saccharomycopses, Lipomyces, Pichia ou Saccharomyces, de preference Schwanniomyces, 

5. Cellule de levure selon la revendication 4, caracterisee en ce que ladite levure est Schwanniomyces occidentalis. 

35 6. Cellule de levure selon I'une quelconque des revendications 1 a 5, caracterisee en ce que ladite premiere se- 
quence d'ADN est constitute d'un fragment Bgft\-Xho\ de 1,8 kb, si elle est obtenue a partir du gene de la <x- 
amylase de Schwanniomyces occidentalis, ou d'un fragment Ba/nHI-Pvull de 1 ,3 kb si elle est obtenue a partir du 
gene de la glucoamylase de Schwanniomyces occidentalis. 

40 7. Cellule de levure selon la revendication 5, caracterisee en ce que ladite premiere sequence d'ADN est constitute 
en partie ou en totalite dela sequence d'ADN correspondant aux bases -1 a -540 de la region precedant le gene 
structural codant pour la ct-amylase, ou en partie ou en totalite de la sequence d'ADN correspondant aux bases 
-1 a -320 de la region precedant le gene structural codant pour la glucoamylase. 

45 8. Cellule de levure selon I'une quelconque des revendications 1 a 5, caracterisee en ce que ladite premiere se- 
quence d'ADN correspond a I'un des regulons regulant ('expression : 

a) de ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, le regulon ttant de preference obtenu a partir de Sac- 
charomyces cerevisiae, 

so b) de LAC4, le rtgulon dtant de preference obtenu a partir de Kluyveromyces iactis, ou 

c) de rtgulons correspondants, obtenus de preference a partir de Schwanniomyces. 

9. Cellule de levure selon I'une quelconque des revendications 1 a 8, caracterisee en ce que ladite deuxieme se- 
quence d'ADN code pour tout ou partie d'au moins I'un des peptides suivants : 

55 
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a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAla ; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LexoAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 

e) MetllePheLeuLysLeuIleLysSerlleVallLeGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Cellule de levure selon la revendication 9, caracterisee en ce que la sequence d'ADN codant pour ledil peptide 
correspond a une partie ou a la totalite des sequences d'ADN suivantes : 

a ) ATGAGATTTTCAACTGAAGG ATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b ) ATGAGATTTTCAACTG AAGG ATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

C ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC; 

e ) ATGATTTTTCTGAAG CTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 

GCA. 

11. Cellule de levure selon Tune quelconque des revendications 1 a 10, caracterisee en ce que ladite troisieme 
sequence d'ADN codant pour une proteine etrangere est une sequence d'ADN naturelle ou synth&ique codant 
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pour une celtulase, une interleukine, un facteur de croissance semblable a I'insuline, un interferon, la lymphokine, 
un facteur de croissance humain, un facteur de croissance des nerfs, t'aprotinine, I'insuline, Phirudine, les hormo- 
nes, les facteurs de coagulation du sang, les antigenes de surface ou de coeur de I'hepatite B, les vaccins viraux 
ou bact6riens, ou le facteur stimulant une colonie de granulocytes/macrophages humain. 

5 

12. Cellule de levure selon I'une quelconque des revendications 1 a 11, caracterisee en ce que ladite quatrieme 
sequence d'ADN provientd'une levure d'un element des genres Saccharomyces, Pichia, Hansenula,de presence 
Schwanniomyces. 

io 13. Cellule de levure selon la revendication 1 2, caracterisee en ce que ladite quatrieme sequence d'ADN correspond 
a une partie ou a la totality du terminateur constitue des nucleotides 1537 a 1740 du gene de la ct-amylase de 
Schwanniomyces, ou a une partie ou a la totalite du terminateur constitue des nucleotides 2875 a 3320 du gene 
de la glucoamylase de Schwanniomyces. 

15 14. Cellule de levure selon I'une quelconque des revendications 1 a 1 3, caracterisee en ce que lesdites premiere et/ 
ou deuxieme et/ou troisieme et/ou quatrieme sequences d'ADN ont 6te modifiees en insurant ou en deietant ou 
en substituant un ou plusieurs nucleotides tout en retenant ou en ameiiorant I'activite biologique de la sequence 
d'ADN. 

20 15. Cellule de levure selon I'une quelconque des revendications 1 a 14, caracterisee en ce que ladite cassette de- 
pression est port6e par un vecteur circulaire ou lineaire, qui comprend facultativement un ou plusieurs genes de 
marqueur de selection. 

16. Cellule de levure selon la revendication 15, caracterisee en ce que ledit vecteur comprend I'un quelconque des 
25 genes de marqueur de selection suivants : 

TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, obtenus de preference a partir de Saccharomyces, Han- 
senula, Pichia ou Schwanniomyces, AMY1 ou GAM1 , obtenus de preference a partir de Schwanniomyces. 

17. Cellule de levure selon Tune quelconque des revendications 15 a 16, caracterisee en ce que ledit vecteur com- 
30 prend de plus une sequence d'ADN capable de controler la replication autonome et le maintien stable du vecteur 

dans I'organisme hote. 

18. Cellule de levure selon la revendication 17, caracterisee en ce que ladite sequence d'ADN est obtenue a partir 
du genome d'un element des genres Saccharomyces, Pichia, Hansenula ou Kluyveromyces, de preference 

35 Schwanniomyces. 

19. Cellule de levure selon I'une quelconque des revendications 17 et 18, caracterisee en ce que ladite sequence 
d'ADN est choisie parmi les sequences se repliquant de facon autonome ARS, de preference une ARS obtenue 
a partir de I'organisme hote respectif. 

40 

20. Cellule de levure selon la revendication 19, caracterisee en ce que ladite sequence d'ADN est SwARSI ou 
SwARS2, capable de contrSler la replication autonome et le maintien stable du vecteur dans Schwanniomyces, 
dans laquelle SwARSI est constituee d'un fragment Cla\BamH\ adjacent au fragment EcoR\ presente a la figure 
2a, et SwARS2 est constituee d'un fragment EcoRI -Bgl\\ contenu dans le fragment EcoRI presente a la figure 2a. 

45 

21. Cellule de levure selon I'une quelconque des revendications 17 a 20, caracterisee en ce que ladite sequence 
d'ADN a 6te modifi6e en deietant, inserant ou substituant un ou plusieurs nucleotides, tout en retenant ou en 
ameiiorant sa fonction biologique. 

so 22. Cellule de levure selon I'une quelconque des revendications 1 a 21 , caracterisee en ce que ladite cassette d'ex- 
pression est int6gree dans le genome, facultativement en plusieurs copies, de preference par recombinaison ho- 
moiogue. 

23. Procede pour la preparation d'un polypeptide, dans lequel un organisme hote de levure est cultive dans des con- 
55 ditions appropriees et le polypeptide est recupere d'une maniere connue d'elle-meme, caracterise en ce que ledit 

organisme hote de levure est une cellule de levure selon I'une quelconque des revendications 1 a 22. 

24. Procede selon la revendication 23, caracterise en ce que des cellules de levure de I'espece Schwanniomyces 
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occidentafis sont utilistes en tant qu'organisme note et sont de preference cultivees sur de I'amidon, de la dextrine, 
du maltose et/ou de la biomasse vtgttale. 

25. Procedt selon I'une quelconque des revendications 23 et 24, caracterise n ce que des cellules de levure du 
s genre Schwanniomyces sont utilisdes pour la production d'une seule prottine de cellule. 

26. Proctdt selon I'une quelconque des revendications 23 a 25, caracterise en ce que ledit polypeptide est I'amylase 
ou la glucoamylase. 

10 27. Utilisation d'une cellule de levure selon Tune quelconque des revendications 1 a 22 pour la transformation et 
('expression de genes Strangers. 



Revendications pour I'Etat contractant suivant: ES 

15 

1. Procedt pour preparer une cellule de levure capable d'exprimer une prottine ttrangere, caracterise en ce qu'au 
moins une cassette d'expression comprenant : 

a) une premiere sequence d'ADN-servant de regulon, 
20 b) facultativement, une deuxieme sequence d'ADN codant pour un peptide signal 

c) une troisieme sequence d'ADN codant pour une proteine ttrangere, et 

d) facultativement, une quatrieme sequence d'ADN servant de terminateur, 

est instrte dans une cellule de levure du genre Schwanniomyces, les premiere, deuxieme et quatrieme sequences 
25 d'ADN etant obtenues a partir d'une levure. 

2. Procede selon la revendication 1 , caracterise en ce que lesdites premiere et/ou deuxieme et/ou quatrieme se- 
quences d'ADN sont obtenues a partir d'un gene codant pour une enzyme amylolytique. 

30 3. Proctdt selon la revendication 2, caracterise en ce que ladite enzyme amylolytique est une a-amylase ou la 
glucoamylase. 

4. Proc6d6 selon I'une quelconque des revendications 1 a 3, caracterise en ce que lesdites premiere et/ou deuxieme 
et/ou quatrieme sequences d'ADN sont obtenues a partir d'une levure d'un tltment des genres Debariomyces, 

35 Saccharomycopses, Lipomyces, Pichia ou Saccharomyces, de preference Schwanniomyces. 

5. Procede selon la revendication 4, caracterise en ce que ladite levure est Schwanniomyces occidentalis. 

6. Proctdt selon Tune quelconque des revendications 1 a 5, caracterise en ce que ladite premiere stquence d'ADN 
40 est constitute d'un fragment Bgl\\ -Xho\ de 1 ,8 kb, si elle est obtenue a partir du gene de la a-amylase de Schwan- 
niomyces occidentalis, ou d'un fragment BamH-Pvul I de 1,3 kb si elle est obtenue a partir du gene de la glucoa- 
mylase de Schwanniomyces occidentalis. 

7. Procede selon la revendication 5, caracterise en ce que ladite premiere stquence d'ADN est constitute en partie 
45 ou en totalite de la sequence d'ADN correspondant aux bases -1 a -540 de la rtgion prtcedant le gene structural 

codant pour la a-amylase, ou en partie ou en totalite de la sequence d'ADN correspondant aux bases -1 a -320 
de la region prectdant le gene structural codant pour la glucoamylase. 

8. Procede selon I'une quelconque des revendications 1 a 5, caracterise en ce que ladite premiere stquence d'ADN 
so correspond a I'un des regulons regulant ('expression : 

a) de ADH1, ADH2, PDC1, GAL1/10, PGK, GAPDH, le regulon etant de preference obtenu a partir de Sac- 
charomyces cerevisiae, 

b) de LAC4, le rtgulon etant de preference obtenu a partir de Kluyveromyces lactis, ou 
55 c) de regulons correspondants, obtenus de preference a partir de Schwanniomyces. 

9. Proctde selon I'une quelconque des revendications 1 a 8, caracterise en ce que ladite deuxieme stquence d'ADN 
code pour tout ou partie d'au moins I'un des peptides suivants : 
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a) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAla ; 

b) MetArgPheSerThrGluGlyPheThrSerLysValValAlaAlalle 
LeuAlaPheSerArgLeuValSerAlaGlnProIlellePheAspMetArg; 

c) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAla; 

d) MetllePheLeuLysLeuIleLysSerlleVallleGlyLeuGlyLeu 
ValSerAlalleGlnAlaAlaProAla ; 

e) Met:IlePheLeuLysLeuIleLysSerIleValILeGiyI*euGlyLeu 
ValSerAlalleGlnAlaAlaProAlaSerSerlleGlySerSerAlaSerAla 

10. Precede selon la revendication 9, caracterise en ce que la sequence d'ADN codant pour ledit peptide correspond 
a une partie ou a la totalite d'une des sequences d'ADN suivantes : 

a ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATT 
TTAGCATTCTCAAGATTGGTATCTGCT , 

b ) ATGAGATTTTCAACTGAAGGATTTACAAGTAAAGTTGTTGCAGCAATTT 
TAGCATTCTCAAGATTGGTATCTGCTCAACCGATTATTTTTGACATGAGA; 

C ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCT; 

d ) ATGATTTTTCTGAAGCTGATTAAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCC ; 

e ) ATGATTTTT CTG AAG CT G ATT AAAAGTATAGTAATTGGTTTGGGATTA 
GTTAGTGCTATCCAAGCAGCCCCTGCCTCTTCGATTGGATCTAGTGCTTCA 
GCA. 
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11. Proc6d6 selon Tune quelconque des revendications 1 a 10, caract'rlse on ce que ladite troisieme sequence 
d'ADN codant pour une proteine etrangere est une sequences d'ADN naturelle ou synthetique codant pour une 
cellulase, une interleukine, un facteur de croissance semblable a I'insuline, un interferon, la lymphokine, un facteur 
de croissance humain, un facteur de croissance des nerfs, I'aprotinine, I'insuline, I'hirudine, les hormones, les 

5 facteurs de coagulation du sang, les antigenes de surface ou de coeur de I'hepatite B, les vaccins viraux ou bac- 

teriens, ou le facteur stimulant une colonie de granulocytes/macrophages humain. 

12. Procdde" selon Tune quelconque des revendications 1 a 11, caracterlee on co que ladite quatrieme sequence 
d'ADN provient d'une levure d'un Element des genres Saccharomyces, Pichia, Hansenula, de presence Schwan- 

io niomyces. 

13. Precede selon la revendication 12, caracterise en ce que ladite quatrieme sequence d'ADN correspond a une 
partie ou a la totality du terminateur constitue des nucleotides 1537 a 1740 du gene de la a-amylase de Schwan- 
niomyces, ou a une partie ou a la total it e du terminateur constitue des nucleotides 2875 a 3320 du gene de la 

is glucoamylase de Schwa nniomyces. 

14. Proc6de selon Tune quelconque des revendications 1 a 1 3, caracterise en ce que lesdites premiere et/ou deuxie- 
me et/ou troisieme et/ou quatrieme sequences d'ADN ont ete modif tees en insurant ou en deletant ou en substituant 
un ou plusieurs nucleotides tout en retenant ou en am6liorant I'activite biologique de la sequence d'ADN. 

20 

15. Procede selon Tune quelconque des revendications 1 a 14, caracterise on co que ladite cassette d'expression 
est portee par un vecteur circulaire ou lineaire, qui comprend facultativement un ou plusieurs genes de marqueur 
de selection. 

25 16. Procede selon la revendication 15, caracterise en ce que ledit vecteur comprend Tun quelconque des genes de 
marqueur de selection suivants : 

TRP5, LEU2, ADE1, ADE2, HIS3, HIS4, URA3, LYS2, obtenus de preference a partir de Saccharomyces, Han- 
senula, Pichia ou Schwanniomyces, AMY1 ou GAM1 , obtenus de preference a partir de Schwanniomyces. 

30 17. Procede" selon Tune quelconque des revendications 15 a 16, caracterise en ce que ledit vecteur comprend de 
plus une sequence d'ADN capable de controler la replication autonome et le maintien stable du vecteur dans 
I'organisme h6te. 

18. Procede selon la revendication 1 7, caracterise en ce que ladite sequence d'ADN est obtenue a partir du genome 
3$ d'un element des genres Saccharomyces, Pichia, Hansenula ou Kluyveromycos, de preference Schwanniomyces. 

19. Proc6d6 selon Tune quelconque des revendications 17 et 18, caracterise en ce que ladite sequence d'ADN est 
choisie parmi les sequences se repliquant de fagon autonome ARS, de preference une ARS obtenue a partir de 
I'organisme hdte respectif. 

40 

20. Proc6de selon la revendication 19, caracterise en ce que ladite sequence d'ADN est SwARSI ou SwARS2, 
capable de contrdler la replication autonome et le maintien stable du vecteur dans Schwanniomyces, dans laquelle 
SwARSI est constitute d'un fragment C/al -BamH\ adjacent au fragment EcoRI prtsente a la figure 2a, et SwARS2 
est constitute d'un fragment EcoH\-Bgl\\ contenu dans le fragment EcoRI prtsente a la figure 2a. 

45 

21. Precede selon Tune quelconque des revendications 1 7 a 20, caracterise en ce que ladite sequence d'ADN a ete 
modifite en deietant, insurant ou substituant un ou plusieurs nucleotides, tout en retenant ou en ameiiorant sa 
fonction biologique. 

so 22. Proctde selon Tune quelconque des revendications 1 a 21, caracterise en ce que ladite cassette d'expression 
est integrte dans le genome, facultativement en plusieurs copies, de preference par recombinaison homologue. 

23. Proctde pour la preparation d'un polypeptide, dans lequel un organisme h6te de levure est cultive dans des con- 
ditions appropriees et le polypeptide est recupere d'une maniere connue d'elle-m§me, caracterise en ce que ledit 

S5 organisme note de levure est une cellule de levure selon Tune quelconque des revendications 1 a 22. 

24. Procede selon la revendication 23, caracterise en ce que des cellules de levure de I'espece Schwanniomyces 
occidenta lis sont utilisees en tantqu'organisme hole et sontde preference cultiv6essurde I'amidon, de la dextrine, 
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du maltose et/ou de la biomasse veg6tale. 

25. Procede selon Tune quelconque des revendications 23 et 24, caracteris' en c qu des cellules de levure du 
genre Schwanniomyces sont utilisees pour la production d'une seule proline de cellule. 

s 

26. Procdde selon Tune quelconque des revendications 23 k 25, caracterise on ce que ledit polypeptide est I'amylase 
ou la glucoamylase. 

27. Utilisation d'une cellule de levure selon I'une quelconque des revendications 1 & 22 pour la transformation et 
10 I'expression de genes Strangers. 
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